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PREFACE 


In presenting the fifth volume of the Annual Review of Psychology we de- 
sire to express our appreciation of the thoroughgoing co-operation of profes- 
sional colleagues here and abroad who have prepared the chapters for this 
and the previous volumes of the Review. Evidences of their generous good 
will in accepting invitations and of thoughtful planning and perseverance in 
the task of bringing to their readers a representative, coherent digest of in- 
teresting contributions have been most heartening. To the authors, chiefly, 
is due the major credit for our having met each essential deadline, culminat- 
ing in on-schedule publication of the five volumes. 

During the preceding vear, important changes took effect in the personnel 
of the Editorial Committee. Keenly noted was the loss of Donald W. Taylor, 
Associate Editor, whose resignation was accepted with regret at the be- 
ginning of the year to facilitate his discharging inescapable obligations con- 
nected with his research and other professional commitments. Beginning 
in 1947, Dr. Taylor co-operated in an advisory capacity on plans for the first 
volume of the Review. In 1949 his official work as Associate Editor began. 
During a six-year period his interest has been unflagging and second to none 
have been his constructive ideas on delimitation of chapters, choice of sub- 
jects for occasional review, nominations for chapter authorships, editorial 
practices, and a host of other details requiring official consideration. With his 
loss, we are happy to announce that Dr. Quinn McNemar of Stanford Uni- 
versity has accepted the invitation of the Board of Directors to succeed Dr. 
Taylor, beginning in January, 1954. 

During the present year Dr. Taylor kindly consented to edit the chapters 
on Vision and Audition; Dr. McNemar, the chapters on Statistics, Individual 
Differences, and Theory and Techniques of Assessment. Each was available 
as consultant on other chapters. The Editor is also indebted to Miss Marian 
Ballin, Graduate Student, for her careful work with galley for twelve of the 
chapters. Her wide acquaintance with psychological literature and technical 
training in editorial conventions greatly expedited the Editors’ work. 

Still other changes in make-up of the Editorial Committee in 1953 include 
the retirement of Dr. J. G. Miller, who completed his term of service, and the 
resignation of Dr. C. H. Graham one year before his term had expired. Hav- 
ing committments outside the States, he suggested that he be replaced by 
one who could participate more actively in the work of the Committee. His 
successor for the unexpired term is Dr. Clifford T. Morgan who will continue 
after 1953 for a regular five year term. 

With the presentation of Volume 5, it is a pleasure here to take note of the 
informed, painstaking approach that has characterized the work of our 
editorial assistants who, throughout each vear, perform many nonacademic, 
recurring functions that are essential to the production of the Review. When 
manuscripts arrive they collaborate in numerous ways to unify the joint 
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efforts of authors and Editors. The Editors would individualize their in- 
debtedness to Ruth Swan and Carol F. Kupke for providing the ““know-how”’ 
to make the initial Review in psychology conform in style and format with 
those of the other well-established Annual Reviews. Other editorial assistants 
no less conscientious in discharging essential duties, slightly narrower in 
scope but none the less important, were Barbara Darniel, Joyce Fairweather, 
Robbie Bass, and Lillian Rutherford. The constant interest of our editorial 
assistants in each volume’s receiving a favorable reception contributes in- 
numerable emendations from which the readers derive many tangible bene- 
fits. 

J.E.A. C.T.M. 

N.D.C. Q.M. 

J.G.D. R.L.T. 

CPS. 
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CHILD PSYCHOLOGY!” 


By HELEN L. Kocu 


Department of Psychology and Committee on Home Economics, The University of 
Chicago, Chicago, Illinois 


GENERAL CHARACTERIZATION AND EVALUATION OF THE FIELD 


Writings that might be said to fall under the classification of child psy- 
chology have been located in the output of many disciplines, pure and ap- 
plied. This state of affairs is no doubt as it should be, for problems seldom fall 
neatly within discipline lines, and interest in the behavior and welfare of the 
child should be widespread. But a heavy price is paid for the interest child 
psychology enjoys, for much of the exploring is in inexperienced hands. Often 
clumsy conceptualizations and procedural blunderings could have been 
avoided had there been acquaintance with the best that psychology and con- 
tributing disciplines had to offer. But what is more serious, hypotheses and 
faiths are confused with facts. As is usual in popular thinking, practical con- 
siderations are the focus of interest, and the implications of even well- 
founded principles are elaborated without any attention to the complexity 
of phenomena. Too many studies, though not without some merit, have little 
foundation in systematic theory. Their goal seems to be rather to observe} 
and to see what ‘‘turns up.’’ In the main, psychoanalytic interpretations are) 
accepted uncritically as are also those based on projective tests. i 

It may be significant that the writer could not find in the products of the 
year the presentation of any new principle of substantial scope, and there) 
were only about a half-dozen investigations that contributed to the clarifica-\ 
tion of even the semidescriptive old ones. These latter studies, since they are’ 
rare, have been singled out for special comment. ' 


Basic CONCEPTS AND DESCRIPTIONS OF 
BEHAVIORAL DEVELOPMENT 


Riesen & Kinder (93), through the intensive observation and controls 
which only the use of animal subjects permits, have subjected to checking 
such descriptions of development as Gesell has popularized or formulated. 
Riesen & Kinder’s subjects were 13 infant chimpanzees. The two authors 
question the usefulness for understanding the detail of human behavioral 
development of the phylogenetic law, though their skepticism is not with 
respect to the validity of the law. They find themselves at least not in dis- 


1 The survey of the literature to which this review pertains was completed May 1 
1953. 

2 The following abbreviations have been used in this review: WISC (Wechsler 
Intelligence Scale for Children); EEG (electroencephalogram); MMPI (Minnesota 
Multiphasic Personality Inventory); M-F Test (Masculinity-Femininity Test); TAT 
(Thematic Apperception Test); PMA (Thurstone Primary Mental Abilities Test). 
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agreement with the principle of alternating flexor and extensor dominance, 
albeit wanting for the present to suspend judgment. They question the prin- 
ciple of a rigid maturational sequence unless the sequence is composed of 
behaviors which develop far apart in time. No one of their subjects, for in- 
stance, followed exactly the 40-item sequence taken from the Gesell-Ames 
Schedules. Riesen & Kinder wonder if the description that development 
proceeds in a cephalocaudad direction applies beyond the prenatal period. 
They suspect that the so-called postnatal cephalocaudad sequence may be 
reducible to a complexity sequence. Their observations are not entirely con- 
gruent with the Coghill principle that the ‘‘part individuates out of the 
whole.’’ Rather, Windle (118) is given some support. The independent action 
of parts, they think, depends on many variables, among which are stimulus 
characteristics. Disappearing reflexes were observed but of the current inter- 
pretation of these in terms of cortical maturation and the overlaying of 
lower-center functions Riesen & Kinder are not sure. They feel that evidence 
to date has not eliminated learning as the mechanism causing the disap- 
pearance. When they observed their animals daily, they witnessed no de- 
velopments they would describe as saltatory. It is worthy of note in_relation 
to the nature-nurture issue that chimpanzees reared by humans and those 
cared for by their own mothers did not differ significantly in their perform- 
ance on the developmental items of the Gesell-Ames Schedules. 

The Windle-Coghill controversy mentioned above and thought important 
because of its relation to the law that development proceeds from the undif- 
ferentiated to the differentiated received some attention from Humphrey 
(55) who, in a carefully documented theoretical article, defends the thesis 
that there are two kinds of reflexes, the monosynaptic stretch reflexes and 
the plurisynaptic cutaneous reflexes. These two classes, she thinks, are af- 
fected differentially by oxygen lack. Humphrey believes that workers who 
saw only generalized responses among the earliest the human organism made 
missed the localized stretch reflexes because of observing when the oxygen re- 
duction was sufficient to dissipate them but still insufficient to block the 
plurisynaptic reflexes. 

One of the important questions in psychology is the nature of the aspects 
of things to which organisms respond. The Gestaltists contend that animals 
respond chiefly to relations, to Gestalts; they offer the results of trans- 
position experiments in support of their contention. Spence (106) has ques- 
tioned this evidence. Hunter (56) offers a critical investigation bearing on 
the Kéhler-Lashley-Spence (68, 71, 106) controversy. He concludes that 
the ‘‘absolute”’ theory of Spence is inadequate to account for the findings of 
his study but expresses the opinion that response is to both relational and 
absolute aspects of stimuli. Which discrimination will serve as the basis of a 
particular transposition is a function of many variables. 

The kinds of relations cognized and abstracted were questions also of in- 
terest to Werner & Kaplan (117). These authors investigated how children 
of different ages handle the problem of inferring from context cues the mean- 
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ing of strange words. The detail in the monograph is rich but cannot be 
specified here. The general developmental trend in the thinking of the chil- 
dren was in accord with principles Werner had enunciated earlier: i.e., from 
concrete to abstract; from syncretic to differentiated, precise, and independ- 
ent. 

R. R. Sears and the group of workers gathered about him have long been 
concerned with making child psychology a more deductive and less inductive 
science. They have been interested in testing the extent, on the basis of 
chiefly neo-Hullian principles of learning, to which they can deduce personali- 
ity characteristics and the behavior of children from knowledge of the be- 
havior of their parents, especially the latter’s reward and punishment prac- 
tices. Nowlis (85), one of Sears’ cohorts, has described in more detail than 
has been given before, the goals, procedures, trials, and tribulations of the 
“‘Towa”’ study of the relation of maternal behavior and child aggression and 
dependency. For those who tend to interpret a complex situation glibly in 
terms of a single principle, it might be pointed out that the original prediction 
the Iowa group made, namely, that children most frequently punished for 
aggression at home would show the least aggression at school, was not borne 
out. Only by supplementing the principle of reinforcement with others, such 
as the drive heightening effect of conflict, could predictions be made that 
were congruent with the curvilinear relation observed between the amount 
of punishment the children received for aggression at home and the amount 
of aggressive behavior they displayed at school. 

Freud believed that the individual in his libidinal development normally 
goes through a series of stages, oral, anal, phallic, which are characterized 
by certain interests, dominant satisfactions, attitudes, and patterns of ad- 
justment. An individual, according to Freud, may remain fixated at any 
one of these levels, or having tried unsuccessfully to function at a more ma- 
ture level, may fall back (regress) to a less mature mode of functioning. 

Two major investigations aimed to determine whether the attribute com- 
plexes Freud imputed to persons fixated at, or regressed to, a certain level 
can be discovered in the population. Barnes (9) included in the qualities and 
attitudes he measured (crudely) in a group of male college students the prin- 
cipal conscious attitudes Freud ascribed to the various fixation and regres- 
sion types. These measures Barnes subjected to a factor analysis. While 
there are three factors out of eleven in the data which might, if interpretations 
were elastic enough, be viewed 4s those of oral, anal, and phallic character, 
most of the traits Freud alleges to be characteristic of a type are not weighted 
with the appropriate factor, though some important ones are. Efforts to take 
into account reaction-formation did not improve the picture. It may be, as 
many have thought, that Freud was a better observer than theorist and that 
he did in a hazy way differentiate some common personality types but was 
in error as to the determinants of the types and so in relation to the nature 
of the total picture. Barnes’ data, of course, shed no light on the origin of the 
factors uncovered. 
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Blum & Miller (13) concentrated their study on the so-called oral type. 
Like Barnes, they offer no developmental or biographical data on their sub- 
jects but rather merely asked the question do the traits Freud attributes to 
the oral character really tend to cluster in some school-age children. Blum & 
Miller observed that their measures of food intake and food interest, social 
isolation, concern over giving, need for being liked, and tolerance for bore- 
dom were correlated to some extent. However, measures of suggestibility, 
depression, need to be ingratiating, and inability to divide loyalties were not 
associated with the aforementioned cluster. This is the same pattern of par- 
tial agreement, partial disagreement with Freudian theory that Barnes has 
noted. Blum & Miller did not see fit to consider explanations for their findings 
that would be alternative to the Freudian. 

Friedman (43) has been concerned with the castration and Oedipus com- 
plex concepts which psychoanalysts consider extremely important for the 
interpretation of development. He has subjected to checking two assertions 
said to follow from the two concepts: (a) that castration fears (and hence re- 
pression of them) are high at about five years and then again at early adoles- 
cence and (6) that girls, after experiencing the castration complex, turn in- 
creasingly toward their fathers and away from their mothers. Friedman used 
a story completion test, containing stories about objects which, according to 
Freudian lore, are penis symbols (e.g., tails). The stories were developed in 
such a way that one completion theme that might readily occur would be the 
removal of the organ. If the story completion in terms of mutilation was not 
given, then it was assumed likely that repression, motivated by castration 
fears, was responsible. Friedman reports that, consistent with Freudian 
theory, the mutilation theme occurred less frequently in the story comple- 
tions of 5-year-olds, especially boys, and in 11 to 14-year-olds than in the 
stories of children 7 to 10 years of age or over 14. It should be noted, however, 
that in presenting significance data. Friedman chose the particular age com- 
parison most likely to yield a significant p. The comparison chosen varied 
from story to story. It would have been interesting, too, to have determined 
whether stories suggesting the mutilation of living things that are not re- 
garded as penis symbols (or are there any such?) were less frequently 
forthcoming from children of certain ages than others. 

Likewise with a story completion and a picture story test, Friedman 
gauged the parent preference of his subjects. Evidence of a mother prefer- 
ence by the girls was yielded by one technique but several other, less direct, 
approaches suggested the opposite. Hence, Friedman thinks his results are 
congruent with Freudian theory. The possibility of a cultural interpretation 
of his findings is not considered. 


PERSONALITY ORGANIZATION AND DEVELOPMENTAL TRENDS 


Trait persistence—The extent to which personality traits tend to persist 
in an individual through the years is a question of much significance. If per- 
s'stence is observed, it is usually interpreted in terms of genetic influences, 
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constant environmental forces, powerful abiding effects of early happenings 
(such as brain damage, or mental trauma). Seldom, however, have we been 
able to disentangle these influences, and relevant studies this year succeed no 
better. Despite lack of success in interpretation, it is of some interest merely 
to know what traits tend to be rather stable. Studies bearing obliquely on the 
issue [Blum & Miller (13); Barnes (9)] have already been mentioned. Smith 
(103) approaches the problem more directly, though her study lacks in ade- 
quate controls. Using a detailed diary kept by a mother relative to the activi- 
ties of her six children (three males and three females), Smith rates these six 
individuals (in some cases also has them rate themselves and each other) 
as young children and then as elderly adults. The rank on the basis of age at 
which learning to talk and to read took place agrees perfectly with adult 
rank in academic achievement, and the estimates of childhood IQ are corre- 
lated perfectly with occupational status in maturity. Thirty-seven per cent 
of the childhood character and personality ratings agree with those made of 
the adult, and 33 per cent differ by no more than one point on the five-point 
scale. How much better than chance this is, Smith does not determine. Nor 
can we be sure of the independence of the judgments of adult and childhood 
status or of their relative freedom from bias. At any rate, among the appar- 
ently more stable traits were found affectionateness, sympathy, conscien- 
tiousness, carelessness, ambition, intelligence, contrariness, irritability, jeal- 
ousy, quarrelsomeness, spunkiness, strength of will, nervousness, and physi- 
cal attractiveness. Consistent with Tuddenham’s (112) observation, Smith 
noted that the boys in the family improved more in status than did the girls. 

Physique and temperament are believed by many to be related. Hence, al- 
though the investigation is only tangential to the psychology of personality, 
it seems appropriate to call attention here to Reynold’s (92) excellent one on 
the distribution of subcutaneous fat in childhood and adolescence. Reynolds 
analyzed roentgenographic measures of subcutaneous fat in eight body 
areas, the measures taken annually for 12 years on 176 children. As evidence 
bearing on the stability of adipose character, the inter-age correlations may 
be cited. The correlations between measures procured at 7.5 and 13.5 years 
range, depending on the area assessed, from +.38 to +.83 for girls and from 
+.41 to +.71 for boys; while the correlations between measures taken at 
11.5 and 16.5 years scatter from +.40 to +.71 in the case of girls and from 
+.61 to +.82 in boys. These coefficients indicate some stability in body type. 
The inter-area r’s, while positive, are low enough to bespeak of striking 
differences in build. Suggestive of but by no means proof of genetic correlates 
of individual differences in adiposity and in adiposity patterns, are the facts 
(a) that Reynold’s subjects tended to resemble their parents (r, father-son 
=-+.48; r, mother-daughter=+.35 for measures of subcutaneous fat on 
calf and chest) and (b) the amount of resemblance in adipose character the 
members of pairs of children showed was directly related to the degree of 
closeness of the pair members in genetic relationship. It is probably signifi- 
cant that the children (both male and female) with the greater fat breadths 
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tended to mature earlier, though it should again be emphasized that this 
cannot be taken as evidence of genetic effects. 

Age trends.—Ames this year has added two studies to her significant se- 
ries of normative studies of preschool children. She has provided a catalogue of 
chiefly verbal behaviors exhibited by children 18 to 48 months of age and 
thought to be revealing of the child’s sense of self (3). Lists of typical behav- 
iors are given for half-year intervals, though, unfortunately, no frequency 
or variability data are presented. 

Ames et al. (4) give Rorschach Test norms for half-year intervals from 
2 to 6, and thereafter, at yearly intervals until 10 years. The volume is a sig- 
nificant contribution to the testing literature as well as the literature on 
mental growth. 

Still another chiefly normative study of personality change, though social 
class differences are also focussed, was directed by Frank and co-workers 
(38) and is based on the results obtained with five projective tests. Although no 
systematic population sampling was attempted, 100 each of prepuberal, 
puberal, and adolescent girls, representing various social classes and circum- 
stances, were the subjects. Some of the chief conclusions of Frank et al. are: 
(a) the adolescent girl, as compared with the younger, is more poised, realis- 
tic, critical of her family, active and interested in men; (b) the American girl 
tends to be somewhat dependent and fears growing up; (c) she is motivated 
by a desire to please more than by an intrinsic well-integrated value system; 
(d) middle-class girls are inclined to be anxious, tense, and ambitious but not 
mature emotionally, even at late adolescence; (e) high intelligence and supe- 
rior school performance tend to block the whole-hearted embracing of the 
feminine role; (f) many lower-class girls have been severely traumatized, 
but those who have not, verge toward a more mature femininity than the 
girls of corresponding age from the middle class. 

Unfortunately, in the case of the reports on some of the tests no quantita- 
tive data are offered, the writer merely giving her impressions and some il- 
lustrative material. Hence the reader is in no position to assess the impres- 
sions and interpretations given. One may also question, on the basis of stud- 
ies such as Samuels’ (97), whether different interpreters would receive simi- 
lar impressions from the data basic to the monograph. Value judgments are 
made freely, judgments which to the reviewer seem at times distinctly old- 
worldish. As is characteristic of diagnoses based on projective tests, the 
‘difficulties’ of the subject receive much attention. Even if one grants the 
diagnoses correct, one may well wonder how a stress sign would be assessed 
if it could be viewed in the context of the individual's total response system 
and the character of the culture in which he must function. 


SocraL CLAass DIFFERENCES 


Attitude and adjustment.—Auld (6), after reviewing the relevant litera- 
ture, asserts that practically all studies show the higher classes to be superior 
to the lower in attitude and adjustment. Social class differences tend to be 
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greater than economic group differences, though the two classifications over- 
lap greatly. Davis (23), of course, would insist that Auld’s is a necessary 
finding, because assessments are made in terms of middle-class rather than 
lower-class values. Although it might be interesting to debate this issue, let 
us merely survey the class differences observed. 

The fact that lower-class children in telling stories interpreted their char- 
acters as motivated by less remote goals than did middle-class children is 
taken by Leshan (74) to mean that middle-class folk, as compared with lower, 
are more inclined to delay pleasure for the sake of future gain. 

Consistent with this view is Ort’s (86) observation that in the case of his 
middle-class group there was less concordance between what youths said 
their parents expected of them and what they did, than was true of the lower- 
class group. The lesser concordance in those of higher social status stemmed 
from the fact that the expectations held for the children involved more un- 
pleasantness, 7.e., less immediate satisfactions, than those held for children 
lower in the social scale. 

To determine directly just what parents struggle for in their children, 
Aberle & Naegele (1) interviewed a group of middle-class fathers. The traits 
stressed by these men, in their hopes for their sons were initiative, respon- 
sibleness, obedience, masculinity, medium aggressiveness, good achievement 
in school, and a business career; for their daughters, lovingness and docility. 
Marriage was the chief goal for the girls. 

Lest comments on this point be taken as suggestive that life for the lower- 
class child is rosy, attention should be paid to studies like that of Drucker 
& Remmers (28) and Pattie & Cornett (87). The lower-class group among the 
seventh-grade children who the former authors had check the SRA-Youth 
Inventory indicated they had more basic personal problems than did the 
middle- and upper-class children. This is in agreement with the trend in the 
Frank (38) monograph. The Inventory, it should be added, was validated 
against school counsellors’ judgments of the children’s adjustments and was 
found to discriminate between extreme groups reasonably well (29). 

Pattie & Cornett asked children from both underprivileged and favored 
neighborhoods to recall as many as possible memories of their life before five 
years. The proportion of unpleasant memories recalled by the children from 
the unfavorable environments was about twice as great as that for the chil- 
dren from the favorable. On the basis of this fact, Pattie & Cornett warn 
against the common practice of diagnosing as pessimists individuals who re- 
call relatively more unpleasant experiences than do others. Experience and 
background need to be taken into account. Of course, one might raise the 
question of what does make a pessimist if not his experiences! 

Turkish children from poor homes were more generous in a sharing test 
than were the economically more favored, according to Ugurel-Semin (113). 

Concepts.—Estvan (34) questioned upper- and lower-class children (ages 
10 to 11) to discover their conception of the extent and implications of pov- 
erty. In conception of extent the groups did not differ, but they did differ in 
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their notions of the way the problem could be dealt with. As far as implica- 
tions were concerned, the lower-class children stressed quantity lacks; the 
upper-class, quality lacks. 

Language.—Children from the upper classes have long been known to 
_ have language skills superior to those from the lower. Consistent with this, 
is Irwin’s (58) observation that after 18 months even the children from the 
professional and business groups differ in speed of speech acquisition, the 
* former making the more rapid progress. Relevant, too, is Estes’ (33) finding 
that her subjects from the professional and administrative groups scored 
higher on the WISC? than the children of day laborers, differences being more 
marked in the case of achievement on the verbal section than on the per- 
formance. 

Because of considerations such as the above, efforts are being made to 
develop so-called culture-free intelligence tests (24, 32). 


EFFECTS OF FAMILY ATTITUDES AND PRACTICES 


Effects on social-emotional behavior and attitudes —Though not always has 
the effect of family attitudes on child behavior been well abstracted from that 
of a correlated complex, it is generally agreed that when parent attitudes are 
bad, the children have difficulty making a good social adjustment. But what 
are the bad attitudes? Some clarification of this point may be achieved by 
the summaries which follow. 

Among the several studies dealing with variables correlated with the 
child’s identification with his parents (this is a fuzzy concept) is that of 
Sopchak (105). He asked students to check the MMPI,? first to describe 
themselves, and then successively, as they thought their father, their mother, 
and most people would check it for themselves. Degree of similarity between 
self and parent descriptions was taken as a measure of identification. (How 
satisfactory this method is, we do not know, but it is essentially the same, 
though different instruments were employed, as has been used in the other 
studies on identification we shall mention later.) At any rate, the correlations 
obtained by Sopchak between the identification scores, especially those for 
identification with the father, and various measures of psychotic trends 
which the MMPI yields, were significant and, of course, negative. 

Cava & Raush (16), estimating child conflict in twelfth-grade boys 
from lower-status homes by means of the Blacky Test, found that the boys 
who showed the least identification with their fathers tended also to show the 
most conflict, especially in the castration fear area. 

Cass (15) noted that parent-child conflict (gauged by a sentence comple- 
tion test) was greatest among a group of adolescent girls when the mother 
was low in awareness of the child’s thoughts and feelings and high in re- 
strictiveness. The measures of child identification with the mother when cor- 
related with the mother’s ‘‘awareness” score yielded a value of +.59 in the 
case of the girls, and +.17 in the case of the boys. 

Schoeppe et al. (98) discovered that success in achieving emotional in- 
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dependence at adolescence and also in sex-role performance is significantly 
associated, in the case of males, with strong identification with the father. 
Severe parental control was associated in the population studied with poor 
performance in all the major developmental tasks of adolescence. Schoeppe 
et al. have based their conclusions on a great volume of material obtained 
in an intensive inter-disciplinary study of the adolescents of a given age in 
a small midwestern community. 

The reverse of the picture just presented is described in the conclusion by 
Montalto (83) that her six-year-old subjects whose mothers were warm in 
affection and low in restrictiveness were the most stable. Montalto based her 
estimates of child adjustment on the Rorschach Test and her judgments of 
mothers on the Fels Child Research Station parent ratings. Montalto be- 
lieves the various combinations of high restrictiveness, low restrictiveness, 
warmth, and coolness in the mother result in quite different personalities in 
the children. 

To reflect the young child’s view of its mother’s functioning and its reac- 
tion to this, wishful and otherwise, Isch (59) used the Sears doll-play pro- 
cedure. Each of 33 pre-schoolers had been observed with its mother in an 
experimental session. The children whose mothers in the experimental 
session had been aggressive with them tended in the doll play to fantasy the 
child doll or the mother doll the recipient of much aggression. If the mother 
had tended to ignore the child, then the child doll was fantasied as receiving 
aggression from the mother, whereas if the mother had been highly directive, 
then neither the child doll nor the mother doll was fantasied as aggressive, 
but the child doll was portrayed as highly directive and negativistic. The cor- 
relations between measures of the child’s actual behavior and his fantasy 
tend to be insignificant or very low. 

Effects on intelligence and language.—As evidence of the interest-stimulat- 
ing effects of a good home, Fischer (35) presents data on institution babies. 
Several groups tested first at six months each averaged about 76 in IQ. All 
of the babies gained on retest. A group tested after adoption scored normal. 
Unfortunately the babies remaining in the institution were not tested at the 
same time; so it is difficult to tell whether the institution continued to be 
depressing. Irwin (58), in the same key, reports institution babies slow in 
speech acquisition. 

Presenting the reverse side of the picture, Klatskin (66) announces that 
a group of 184 babies who are participating in the Yale-Senn program, 
hence receiving good pediatric care and with parents being counselled to be 
permissive and flexible, scored above average on all but five of the items of 
the Cattell Infant Intelligence Scale when one year old. Sixty-one per cent 
of the group walked at 12 months, as compared with only 28 per cent of the 
babies used to establish the Gesell norms. The lack of acceleration on the 
verbal items leaves one wondering. Does this illustrate Shirley's (101) con- 


tention that an early enthusiasm for motor activities may delay the onset of 
speech? 
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Mowrer (84) takes the view that, as in the case of talking birds, devotion 
by the baby to the person who cares for it, is highly important for the de- 
velopment of good motivation for speech learning. 

Probably illustrating the effect on speech development of parental neg- 
lect and abuse is the case of extreme speech retardation which Du Pont e¢ al. 
(30) describe. The speech difficulty in this eight-year-old boy cleared up 
considerably under simple nondirective therapy with no special attention to 
speech correction. 

Effects of feeding practices.—Since some of the psychoanalysts painted 
very distressing pictures of the effects of early feeding frustrations, there 
has been much debate on this topic. Already we have mentioned the indirect 
approaches to the question by Barnes (9) and by Blum & Miller (13). Sewell 
(99) has taken a more direct route. He found no relation, however, between 
the type of feeding and weaning experience of his 161 child subjects while in- 
fants and their present self attitudes as measured by the California Test of 
Personality, ratings their teachers gave them on independence, emotional 
control, acceptance of authority, etc., or their mothers’ reports of nervous 
habits, speech difficulties, sensitiveness, or fears. Rosenthal (94), on the 
other hand, comparing the mothers of a group of babies suffering from 
eczema with the mothers of a parallel group free from the disorder, reports 
that the former comforted and handled their babies little. We should like to 
see the investigation repeated with better matching and control of examiner 
bias. 

The Wolfenstein (119) historical account of the trends in child feeding 
and infant care in the United States from 1914 to 1951 (the various editions 
of the Children’s Bureau pamphlet, Jnfant Care, was the source material) 
made one view with some misgivings his present faiths, since the fashions 
during the last 35 years have been many. The present fashion is favorable to 
a child-cued program and highly permissive treatment. 

Family effect on social goals —Ambition or mobility was the subject of an 
investigation by McGuire (78). The offspring in socially mobile families 
and from families in which the two parents had different social-class back- 
grounds tended to be mobile, the incidence of mobility being 62 and 72 per 
cent, respectively, in the above mentioned groups and only 15 per cent in the 
offspring of nonmobile families. 

Family effects on ethnic attitudes—Goodman (45), in a very convincing 
study, using testing, intensive interviewing, and long observations of both 
parents and children, demonstrated a considerable degree of race conscious- 
ness among both Negro and white preschool children in mixed schools. Some 
of the awareness and attitudes stemmed clearly from parent teaching. Forty 
per cent of the Negroes and 24 per cent of the white children showed high 
race awareness; 15 per cent of both groups, very low awareness. About three- 
fourths of the Negro children would have preferred to be white. About one- 
fourth of Goodman’s 103 four-year-old subjects exhibited a strongly en- 
trenched system of color-related values. 
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Frenkel-Brunswik & Havel (41), Bird et a/. (12), and Helfant (53) sketch 
the prejudice picture of the school-age child. All report some, but only a 
small-to-medium relation between the ethnic attitudes of parents and their 
children. Prejudice was fairly systematized and generalized in both and, ac- 
cording to Bird et al., not related to the social, educational, or occupational 
level of the family or its church affiliation or attendance. Generally the 
culturally less favored were less prejudiced against the culturally more 
favored group than the latter were to the former. 

Brunswik and Havel raise, but do not answer, the question as to whether 
prejudice is learned chiefly through direct and indirect teaching by family 
and peers or results from mechanisms such as displacement of hostility. 

Ordinal position and sibling influence-—Studies in this area are few. Com- 
menting on the Fels Research Institute findings, Lasko (72) states that 
mothers tend to be less warm with their first child than with later ones, 
though at the same time more child-centered and protective, at least during 
the child’s first two years. 

In a sharing test the only child was more selfish, on the average, than the 
child with sibs, according to Ugurel-Semin (113). Refutation of the idea 
that sibs strongly stimulate development in speech comes from Irwin (58). 

Evidence of the effects of sibling rivalry was obtained by Lees (73). 
Among the adult miners who were given the opportunity for some college 
training, first-borns made especially good use of these opportunities, i.e., did 
well in their studies and improved their occupational status, if they had sibs 
who had done better than they socially or economically; otherwise, accom- 
plishments were moderate. The performance of the second-borns seems to 
have been unrelated to their sibs’ life status. Markedly fewer of the first- 
borns in the group had married, and they were less sociable than the second- 
borns. 


PEER RELATIONS 


McGuire (79) contends that within the first half year after entering 
school, children have established their peer allegiances, and these tend to 
persist through the rest of the year. The friendship groups are composed 
usually of children of one sex. While a certain degree of general social con- 
formity seems important for peer acceptance at these ages, the child who is 
too conforming and adult-centered is not popular. 

Second-graders, when citing their reasons for liking a classmate, listed 
external characteristics, such as good looks or a nice home, according to 
Dymond et al. (31); whereas sixth-graders most frequently mentioned 
personal qualities, such as friendliness and cheerfulness. 

In the studies listed below we have illustrations of four different methods 
of gauging a child’s acceptability to his peers. Tuddenham (112) thinks his 
version of the ‘‘guess who”’ test furnishes good clues as to social adjustment. 
It indicates who accepts and who rejects the child and for what reason, 
whether those the child admires also admire him, whether he is generally 
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approving or disapproving of his classmates, and whether his view of himself 
is consistent with that of others. 

Clark & McGuire (18) have devised two instruments to gauge peer 
status. The basic data in both cases are derived from a standard sociometric 
procedure. Each positive and negative mention a child receives is then 
weighted, in the case of one instrument (80), by the arbitrary weight given 
to the social class of the child making the mention; in the other instrument 
(18), by the popularity in the class of the child making the mention. 

Believing that the degree to which a school-age child has pregenital at- 
titudes would be inversely related to the degree of his acceptability to his 
classmates, Miller & Stine (82) devised a story completion test and scoring 
scheme to reveal pregenital attitudes. The test was validated against a socio- 
metric test criterion. The scores of the two instruments correlated negatively, 
as expected, but ‘‘anal’”’ attitudes seemed to be associated with popularity. 
This latter fact may remind the reader of Davis’s (23) contention that many 
so-called ‘‘anal’’ characteristics, e.g., thrift, cleanliness, thoroughness, are 
traits highly valued by the middle class. This is the class to which most of 
Miller & Stine’s subjects belonged. 

The hypotheses of Dymond e¢ al. (31) and Ausubel & Schiff (7) that em- 
pathy, the ability to know how others think and feel, is significantly corre- 
lated with popularity and with age were verified. Self knowledge, according 
to Ausubel, was slightly less accurate than knowledge about others. 


THE SEXES 


In a nicely conceived study of peer judgments, Tuddenham (112) em- 
ployed a ‘‘guess who” technique with subjects from grades one to five. He 
reports the following findings: girls had the reputation among their class- 
mates of being quiet, docile, and friendly; boys, of being aggressive, bold, 
and active. Girls generally were more approved than boys, but the boys’ 
stock went up with age; the girls’, if anything, went down. The children, as 
a rule, favored their own sex. The boys cast proportionally more favorable 
votes for their peers; the girls giving both more positive and more negative 
votes, a finding somewhat in line with that of Sherriffs & Jarrett (100) that 
women tend to more extreme grading than men. The boys were inclined to 
agree better with each other in their estimates of a given classmate than were 
the girls. The individual pupil’s judgment of himself in various traits was not, 
in the main, in good accord with his classmates’ assessment of him, but the 
degree of agreement increased with age, a trend similar to that reported by 
Ausubel & Schiff (7). The teacher’s judgment of a given child and that of his 
classmates agreed only moderately, the extent varying with the trait and, 
as Ausubel & Schiff also noted, inversely with the age of the child. : 

Items related to the success individuals have in the major developmental 
tasks of adolescence vary somewhat with sex, according to Schoeppe et al. 
(98). Positive feeling for boys, for instance, was important for the success of 
the girl in her sex role in the population studied, whereas the reverse seemed 
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not to hold for boys. Close conformity to the social code appeared to have 
been more important for girls than boys. For boys, success in the achieve- 
ment of emotional independence was related to the adoption of the morality 
system of the father; while for girls, no significant relation with this latter 
variable was noted. 

The idea that masculinity or femininity may bea biosocial complex is sug- 
gested by the data of Schoeppe et al., and is borne out further by the findings 
of Ford & Tyler (36) who factored separately the Terman-Miles M-F? 
scores obtained from 153 female and 157 male ninth-graders. Two factors 
were adequate to account for the communalities in the responses of the 
males; three factors, in the responses of the females. The patterns of item 
loadings suggested that the factors, in the case of the boys, may be tough- 
mindedness and interest; in the case of the girls, sensitivity, interest, and so- 
cial role. The latter factor seems consistent with the observation of Schoeppe 
et al. that conformity to the behavior code approved by the culture for fe- 
males is very important for the girls. 

Perhaps the findings of Veroff et al. (114) reflect some of the detail of this 
behavior code. These investigators gave a picture-story test depicting in- 
dividuals at work to a group of 16- to 18-year-olds. More achievement themes 
occurred in the girls’ stories when the picture contained a male than whena 
female figure, whereas the boys showed no such difference in reaction. The 
girls, moreover, did not, as the boys did, increase achievement themes when 
put under pressure to do well. It is the opinion of the investigators that their 
results support Mead’s (81) idea that, at adolescence and after, the female 
stops competing with the male but rather competes and achieves through 
him. 

Sex differences in race awareness among preschool children were noted 
by Goodman (45), the girls being the more race conscious. This seems in 
line with an oftrepeated finding that girls have stronger social interests than 
boys. 

Following are some of the sex differences found by Ames et al. (4) in the | 
responses of young children (two to ten years) to the Rorschach Test. R, A%, 
and A were higher for boys than girls; F, FC, and H%, lower. No consistent 
sex differences were apparent in D, F%, M, FM, m, W%, C, or Clob. No 
attempt will be made here to suggest what these differences may mean. J 

The view that a person who draws first, in the Draw-a-Person Test, a 
human figure opposite in sex to his own is a sexual invert has been challenged 
by Granick & Smith (48). The third of their female subjects who drew the 
opposite sex figure first and the eighth of their male subjects, did not differ 
in M-F score on the MMPI from those who drew the like sex figure first. 
The M-F scores of the two sexes in each of the aforementioned two groups 
differed significantly and to the same degree. All investigators [Weider &} 
Noller (116); Knopf & Richards (69); etc.] agree, however, that the two sexes 
tend to draw first the figure of their own sex. Knopf & Richards hold el 
opinion that deviation from the trend is associated with anxiety in the child. 

(See also Lorr & Jenkins (76) below for other findings on sex differences.) 
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DEVIANT BEHAVIOR AND ATTITUDES 

Among the deviant groups on whom attention will be centered are the 
children with behavior problems, the delinquent, the school failure, the 
neurotic. Discussion of psychotic children will be omitted. 

To determine whether any order underlies the various behavior difficul- 
ties children fall victim to, Lorr & Jenkins (76) made a factor analysis of the 
Ackerson data on the forms of deviant behavior reported for a large popula- 
tion of children brought to a behavior clinic for help. The analysis yielded 
five common factors which the investigators interpret as follows: encephalitis 
or brain damage, unsocialized aggression, socialized delinquency, over- 
inhibition, and schizoid character. The second-order factors Lorr & Jenkins 
label rebellion and schizoid disorganization. The analyses for the two sexes 
revealed slightly different patterns of factor weights, but the same factors. 
Male socialized delinquents, for instance, are more likely to steal, females to 
be untruthful. The overinhibited girl is likely to cry frequently; the boy not. 
Among the unsocialized aggressives, the girls tend to have tantrums, the 
boys to be destructive. 

For the purposes of organization we shall follow Lorr & Jenkins’ classifi- 
cation as far as possible. 

Brain conditions and problem behavior—A number of investigators 
{Lorenzo & Sidwell (75); Gunnarson & Melin (50); Kennard (64)] agree in 
finding a greater incidence of abnormal brain wave patterns in behavior 
problem children, and all are in accord, too, that brain damage can be asso- 
ciated with an abnormal EEG? and with behavior disorders. There is not, 
however, consensus as to what an abnormal EEG means. Kennard, for in- 
stance, after reviewing the literature, comes to the conclusion that an ab- 
normal EEG (especially the pattern of high voltage fast waves) may be mere- 
ly the internal correlate of emotion. She thinks there is no evidence to 
justify the view that once an individual shows a deviant brain wave pattern 
then he can be expected to continue to show it or that the abnormal pattern 
is necessarily a sign of a defective brain or constitution. Lorenzo asserts that 
an abnormal EEG was characteristic of more of the behavior problem chil- 
dren he worked with than any other physical, neurological, psychological, or 
sociological condition he explored. He admits that an “abnormal” family 
history increased the probability of an abnormal EEG ina child, a fact which 
may, of course, be interpreted to support either side of the controversy 
Kennard specifies. 

It is frequently assumed that the abnormal EEG stems from brain dam- 
age received at birth, as from anoxia, among other things. Significant in this 
connection is the demonstration by Hurder & Sanders (57) of very high 
resistance by newborn rats to impairment of learning ability from severe 
anoxia (adult rats had not shown this). This finding is in line with current 
pediatric faiths. 

Apropos of the problem of the relation of brain damage and behavior, 
one might mention the report by Keeler & Bender (63) on a 3- to 15-year 
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follow-up of 20 children originally admitted to Bellevue Hospital with 
Sydenham’s chorea. In the years following their release from the Hospital, 
11 made a very poor social adjustment and 6 more only a fair one. Among 
those who made the poorest adjustment were those with evidence of per- 
manent brain damage, with a low IQ, or a home of poor quality. 

Delinquent and aggressive behavior —The year’s most extensive treatment 
of the problem of the aggressive child is given in a volume by Redl & Wine- 
man (91). Believing extreme aggressiveness results from ego disturbances, 
these authors present their theories relative to what occasions the break- 
down of controls or what interferes with their development, picture in vivid 
detail the defense mechanisms typical of the delinquent, and sketch the meth- 
ods found useful in helping the residents of a home for disturbed children to 
develop effective ‘‘controls from within.” 

The extrapunitiveness Redl sees to be common in the aggressive child 
shows clearly in Rosenzweigs’ (95) study of a group of clinic children. On the 
Picture-Frustration Test the clinic children gave more extrapunitive and 
fewer impunitive responses than normals. 

That the aggression frequently found in habitual delinquents tends to be 
widely generalized is suggested once again by Wattenberg & Faigenbaum’s 
(115) observation that there is little consistency in the type of offense in 
which a given individual indulges except in the case of the sex delinquent 
and auto thief. 

Although delinquency is a legal category and no uniform personality 
characteristics mark the group, Gough & Peterson (46) are of the opinion 
that delinquents rather generally lack the ability to look at themselves as 
social objects, in Mead’s terms, to take roles. On the basis of the theory, 
though it must be admitted some items quite irrelevant to the theory were 
included, Gough & Peterson constructed a questionnaire which was success- 
ful in identifying 78 per cent of the adolescent delinquents in one population 
and 66 per cent of the adult offenders in another. Twenty-three per cent of a 
group of normals were misclassified by the instrument. The delinquents, 
when compared with nondelinquent controls, were rated on the average as 
more rebellious, headstrong, defensive, dissatisfied, tense, anxious, wary, 
sensitive, and affected. 

Examples of what might be called socialized delinquency may be found 
in Clark’s (19) investigation of experiences in vandalistic episodes on the part 
of students. The incidence of experiences of this sort among the students was 
higher in males than females. More of the episodes, too, in the case of the 
males occurred after the boys were 10 years old; more of those in the case of 
girls, before 10. The crowd situation, enjoyment of excitement, and hostility 
to the individual or agency whose property was damaged were cited fre- 
quently as motivations for the acts of vandalism. Clark thinks vandalism, 
within limits, is normal and with this Crane (21) essentially agrees. He found 
that 41 per cent of the male gangs and 27 per cent of the female gangs he 
studied were predatory. Crane found gangs were usually one-sex organiza- 
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tions, reached a peak in frequency in the years 9 to 13, and had an average 
size of six to seven members. They tended to have a secret language or code, 
a secret meeting place, and in the case of male gangs, initiation rites. 

It has been presented as a basic law of behavior that whenever an organ- 
ism is prevented from achieving its desires it reacts aggressively. There has 
been much debate in regard to whether aggression is a necessary consequence 
of frustration. Of course, although the issue can receive no final test, it is of 
interest to know with what other kinds of responses organisms do meet 
blocking. On this point Davitz (25) contributes. He systematically exposed 
one group of boys to experiences which stressed and rewarded constructive 
activities and a matched group, to experiences in which aggressive behavior 
was reinforced. Following an imposed frustration, the behavior of the two 
groups was carefully observed. The aggressively trained boys behaved more 
aggressively after frustration than did the constructively trained. The latter 
also behaved generally less aggressively after than before training. 

Neurotic behavior —Anna Freud (42) gives a revealing review of some of 
the kinds of insults, fears, and dependency stimulations children often have 
when hospitalized or merely when ill at home. With a view to determining 
some of the psychological effects, especially the persisting ones, of an opera- 
tion, Jessner et al. (60) observed 142 tonsillectomy and adenoidectomy cases 
before, during, and after the operation. Twenty per cent of the children had 
a severe persisting reaction, those children most upset by the operation ex- 
perience being usually the ones who were disturbed individuals before. 
Symptoms observed by Jessner et al. in their traumatized subjects were 
nightmares, refusal to talk, tics, fears of white coats and of death, wetting 
and soiling, screaming, and increased dependency. Since the information the 
child had about the operation beforehand was not related to the severity of 
his reaction to the event, Jessner and his co-workers think information needs 
to be supplemented by other experiences such as acting out, perhaps. 

‘“‘Accident proneness,’’ which has frequently been interpreted as a neu- 
rotic sign, was the subject of an investigation by Krall (70). She located, 
through hospital records, 32 five- to eight-year-old children who had had at 
least three major accidents in four years. These children were observed in 
two doll-play sessions and their behavior compared with that of 32 controls 
matched in age and school grade. (Would that the matching had included 
environment or social class!) The ‘‘accident prone,”’ in relation to the con- 
trols, were in the doll-play sessions more active, intense, indulged in more 
verbal aggression, and made more bids for attention. It is significant that 
the social histories of the ‘“‘accident prone”’ revealed that the child in many 
cases was a member of a large family, was late in the birth order, came from 
a broken home, and had transferred frequently from school to school. It was 
suggested that, instead of giving an interpretation of the high accident rate 
of these children in masochistic terms or even in terms of cognitive dediffer- 
entiation, as is usual, one might do well to consider the possibility that the 
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accidents resulted from inadequate supervision and greater exposure to 
danger. 

If nail biting were taken as a sign of neuroticism, then 61.6 per cent of 
Chicago’s public school children of eight years would be neurotic! Malone 
& Massler (77) found the incidence of nail biting in five-year-olds was 38 per 
cent. It increased until eight years when it reached 61.6 per cent and then 
declined. However, at 18 years, 28 per cent of the pupils were still biting their 
nails. After 10 years girls offended less than did boys. 

Since some attribute most feeding difficulties in babies to anxiety gen- 
erated by poor handling, the observations of Jorup (62) on the 30 per cent 
of his 589 infant patients who were dyspeptic will be of special interest. Nine 
per cent of the babies had severe difficulty. The disorder, characterized by 
loose stools, high intestinal pressure, and pain, begins, according to Jorup, 
at birth and is relatively independent of diet or environment. Since anticho- 
linergic drugs give immediate relief, Jorup believes the disorder stems from 
parasympathetic dominance. In support of the theory that the difficulty is 
constitutional, he points to the fact that the parents and sibs of dyspeptic 
babies are frequently sufferers from gastrointestinal disorders. Jorup’s thesis 
is in accord with the fact that rats can be bred for differences in frequency of 
defecation and micturition (when under stress). Since, however, the visceral 
behavior and overt fear behavior are not well correlated, Bindra & Thomp- 
son (11) are of the opinion that what is inherited is a tendency to gastrointes- 
tinal lability rather than, as has been asserted, a tendency to fear. 

Learning difficulties —A great volume of literature indicates that learning 
is influenced by motivation. But what are these motivations in detail and 
what is the pattern of their effects? Gaier (44) was interested in some of the 
characteristics of the rigid learner. He used a unique technique, “‘stimulated 
recall,”” to get a picture of what 11 university students were thinking about, 
in respect to the course subject matter, themselves, and others, during the 
regular sessions of a university class. A tape recording was made of each ses- 
sion, and this was used to stimulate the student’s recall of his thoughts and 
feelings. Also each student was given a battery of tests to measure general 
academic competence, a Rorschach test to gauge his anxiety, rigidity, and 
negativism, and finally the course examination consisting of two sections, 
one composed of ‘“‘rote’’ questions and one of ‘‘thought’’ questions. The esti- 
mates of anxiety and rigidity made by three independent judges correlated 
negatively with all the tests of academic timber and of course learning, ex- 
cept the rote part of the final examination, and correlated positively with 
the amount of derogatory thinking about the self which the stimulated-recall 
sessions exposed. The negativism rankings also correlated positively with 
the amount of critical thinking about self and others in which the students 
indulged during class. Gaier thinks anxiety makes the student rigid and this 
in turn prevents him from dealing in a creative way with new experiences. 
One may wonder, of course, whether the student is anxious and critical be- 
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cause he is having to deal in his courses with material beyond his depth or 
whether the situation is as Gaier outlines. 

Pushing the question one step back, Pearson (88) gives a lengthy review 
of kinds and sources of motivation disturbances as well as organic conditions 
believed to contribute to learning difficulties. Staver (109) and Sperry et al. 
(107), in their study of 15 to 17 poor achievers, point chiefly to parent atti- 
tudes as a source of trouble. The parents of their subjects had tended to dis- 
courage efforts of the child to get answers to his questions and to learn. They 
had explained little and had tried to keep the child dependent. 

Like Staver’s, Kimball’s (65) results point back to disturbed relations 
with parents as a probable determinant of poor scholarship. Her subjects 
were 20 bright, prep-school boys who were failing badly in their studies. A 
sentence completion test, given in an English class, suggests that the boys 
had a poor relationship with their fathers, and the TAT? indicated that they 
had much hostile feeling. What is cause and what is effect, may here again 
be difficult to determine. 

Graham (47), working with retarded readers, believes that most of 
them suffer from some specific blocking which causes them to resist instruc- 
tion. He calls attention to the fact that when his group was tested with the 
WISC and the Wechsler-Bellevue Test, they scored above average on the 
object-assembly, picture-completion, picture-arrangement, block-design, and 
comprehension sections; below average, on the arithmetic, digit-span, digit- 
symbol, information, and vocabulary sections. This is a profile Wechsler 
gives (we wonder with what wisdom) for psychopaths. 

At any rate, support for the view that the motivation to do good work in 
school stems often from quite other than mere interest in the subject matter 
is given by a study by Frandsen & Sessions (37). The investigators correlated 
the grades a given child made in his various school subjects and self-ratings 
on degree of interest in the subjects. These correlations ranged from —.80 
to +.90, depending on the child and, no doubt, still other variables. The 
median correlation for the class was +.51. The children whose grades were 
unrelated or negatively related to the level of their interest were said to be 
extrinsically motivated; the others, intrinsically. There was no difference in 
the general adjustment of these two groups of students, according to the in- 
vestigators. 

Physical handicap and adjustment.—Holding the opinion that no really 
dependable information is available relative to the extent of physical handi- 
caps among children, Barker et al. (8) made a thorough survey of one county 
in Kansas. In this county, 7 per cent of the boys and 4.8 per cent of the girls 
were suffering from a disability of some sort. Ninety per cent of the handi- 
capped attended regular school; 6.5 per cent, no school. Speech and vision 
defects were the most common. The data of the study have many implica- 
tions for education. 

To determine whether a cardiac condition tends to maladjust a child 
seriously, Freed & Cruickshank (39) compared the performance of 71 cardiac 
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cases with 71 matched normals on a sentence completion test designed to 
reflect adjustment. The authors conclude that both groups of children 
showed a good relationship to their fathers but, whereas the normals exhib- 
ited a better adjustment to their mothers than to their fathers, the reverse 
obtained for the cardiacs. The hypothesis is presented that the cardiacs have 
a less good relation to the mother because it is she who must constantly en- 
force the inactivity their heart condition dictates. 

That cerebral palsy may interfere with perceptual as well as motor proc- 
esses is suggested by the finding of Dolphin & Cruickshank (27) that, in a 
tactual discrimination task, a group of cerebral palsy victims were not so 
adept as normals in distiuguishing figure from ground. 

Since it has been asserted, perhaps with not too much foundation, that 
children identify better with the figures on a TAT card if the figures are like 
them in sex, in color, etc., Greenbaum et al. (49) have raised the question of 
whether handicapped children will reveal more of themselves in a picture- 
story test in which the figures pictured are handicapped persons. The in- 
vestigators presented to one group of orthopedically handicapped children 
pictures depicting cripples, and to the other group cards from the regular 
TAT series. The latter the authors believe encouraged more productivity 
than did the revised pictures, although in this latter case 50 per cent of the 
stories touched upon the handicap. 

Mental defect—Porteus & Corbett (90) have reviewed the statutory defi- 
nitions of feeblemindedness in the United States since 1912 and the provi- 
sions for the commitment of children to institutions for mental defectives. The 
authors feel that the provisions often do not insure competent diagnosis and 
tend to stress too much mere test score. Some states have left the diagnosis 
entirely to physicians. 

Of course, many question the wisdom of institutionalizing any but the 
most seriously defective. In relation to this issue, Charles’s (17) study is 
timely. To discover what the feebleminded make of themselves he located 
and investigated 151 of the 206 feebleminded individuals whom Ballers had 
studied in 1935 and who were then over 21. The dullest 20 per cent are still 
institutionalized. The rest differ from the general population in that fewer 
have married, and the married have fewer children; also, their death rate has 
been higher, especially death by violence. Forty per cent of the males have 
violated the law (chiefly because of drunkenness) and 50 per cent have had 
some sort of relief aid in the 17-year period which included the depression 
years. The type of dwelling in which they live and the incidence of home 
ownership is average. Their children are, in the main, proceeding through 
school with little difficulty. Altogether, the picture is a not-too-discouraging 
one. 


COGNITIVE PROCESSES AND MENTAL MEASUREMENT 


Mental measurement.—Table I summarizes some of the inter-test relations / 
explored in the year’s writing. There seems to be agreement that the Stan- | 








2u1310q 
Surddey 
Buroely 
auenfoey pue ysZnouaepoor) 
(WW jo 8013998 + A+S) 
VWd pure ysnouspoor 
VWd put yznouapoos 


satiqy 
[ewe Areaig 
ysnouspoos) 








(s) 


rayeqsuy 








Buipesy eAtssoiZ01g pue 

AqunzeP [ewWeypy eraso0seD 
Supeay ais 

“sa1Z01g pue (Teq12A) OSIM 
Aqinqzeypy [BWI 

BuO Te pue (TINY) OSIM 





projyuejys pure inyuy 
anywy 
pue (s0ueul10j19q) OSIM 
anyyy pue (jeq19A) DSIM 
anyyy pue (114) DSIM 
piojyueis 
pue (a0ueurI0s19g) DSIM 
projueys pue (Teq19A) OSIM 
projueis pue (IN) OSIM 


inquy 


8° 01 yoUulg Pr0juejiS 


1s'tt OT tol 


8b Fl 


OS 86 





(z) sniiv 





97’ bt 08°66 





(aoueUr10j19q) q-M 

pue (soueuliojiag) SIM 
(Teq49A) 

a-M pue (jeqiaA) DSIM 


(M4) €-M pure (My) OSIM 


(aoueu10j13q) 
MAI]IG J2TSYIIM 
(Teq12A) 
BNAIIIG JaTSyYIIAA 
(Ima) 
aNAI]eg JIjsyooA, 


00° 601 
00° Zot 
00°90T 


00°ZTT 


fst 
07 
for 


(97) 
309 8 
21738] 9q 





ysnou 


“poor, pue (TINY) OSIM 


uvW-e-MeIGg 


Z1°$6 *y2nouapoor 


8° LOI 


— 719%S¢ 


(zs) 
qoraueyy 





eL° 
£3" 
43° 


psojyuris 
pue (aoueuti0j}i9q) OSIM 
Projyuers pue (Teq19A) DSIM 
Psojyuejzs pue (1iNY) OSIM 





8h" 
8s° 
zo°+ 


pioyueis 
pue (a0uemri0yiag) OSIM 
projueis pue ([eq19A) OSIM 
psojueys pue ([INY) OSIM 


7L°6 


Il’ ozt yUl_ P10jueIsS 


£L°et LOTTI 


6l' tt 


6£° ¢0! 


a Ze 


(pS) 
pueljoH 





WZ'et 9S" LOT 


— oes 


(ot) 
aaqie) 
2g SB 





Pazepa110D sa] 


sap 


(Or) 
W 


» (OD 
Ww 


a 


(OD) 
Ww 


» OD 








SUOT}LIAIIO7 189] -19zU] 


OSIM Bey? 29490 8189, 


u01993S 
DOUBULIO}IIg 


u01998S 
Teqi2A 


TPS 1M 





("s14) 

PIC (814) 
10 «6aaBury 
Ww 


syafqng 


10783 
-1989Au] 





NAYATIHD AOd SISA], ADNADITIALN] SNOIMVA AO SAIGALS AALLVAVAKOD 
I] ATAVL 





asa ead 


CHILD PSYCHOLOGY 21 


ford Binet yields a higher IQ than the WISC and that the verbal section of 
the latter correlates somewhat better with the Stanford Binet than the per- 
formance section, in fact almost as well as the full scale. The low sigma of the 
Stanford scale at age five is once again revealed (110). 

Triggs & Cartee (110) and Holland (54) disagree on the magnitude of the 
difference between the Stanford Binet and WISC scores in relation to the 
variables of age and size of IQ, Holland contending that there is no relation 
between these latter variables and the magnitude of the difference between 
the two tests. Although he employed some refinements of technique, e.g., 
control of practice effects which was lacking in the others’ study, it is doubt- 
ful whether these are adequate to compensate for the paucity of his subjects 
and their wide irregular scatter. 

De Lattre & Cole (26) found that their 10- to 16-year-old subjects scored 
better on the WISC than on the Wechsler-Bellevue Scale. The correlations 
between parallel subtests ranged from +.19 for picture-arrangements to 
+.71 for digits, a warning against assuming that tests with similar names 
measure the same processes. The verbal section of the WISC correlated bet- 
ter with the Progressive Reading Test than did the California Mental Ma- 
turity Test. 

Hanvik’s (52) emotionally disturbed subjects scored significantly better 
on the WISC than on the Goodenough Draw-a-Man Test, the correlation 
between the two instruments being insignificant. Hanvik thinks the drawing 
test is more anxiety activating. Possibly the children were indifferent to the 
whole test situation but found it more difficult to resist when the experi- 
menter was stimulating them directly and constantly offering new tasks. 

Ansbacher (5) reports that the Draw-a-Man Test correlated +.41 with 
the Thurstone PMA? Test but scarcely at all with the MacQuarrie Mechanical 
Ability Test. His subjects were a group of 100 fourth-graders. Since the sub- 
tests in the PMA battery of space, perception, and reasoning showed the 
most relation to the Drawing Test, and the MacQuarrie Test r’s were very 
low, Ansbacher thinks this is evidence that the Drawing Test is less one of 
motor skills than of cognitive processes. 

There has long been the suspicion that the examiner’s relation with the 
preschool children he is testing determines in a major way the children’s 
performance. Sacks (96) in a neat study tested this assumption. She carefully 
matched three groups on the basis of an initial Stanford Binet test score. 
Then for 10 days she tried to ingratiate herself with the children of one group 
by serving them well when at nursery school and to establish poor relations 
with a second group by being sober, indifferent, and denying help; she 
had no contact with the children of group three. If she was really able to 
control the tester’s bias, her results are significant. On retest the groups 
gained on the average 14.5, 5.0, and 1.6 IQ points, respectively. Apparently 
mere familiarity with the tester had some favorable effect on test perform- \ 
ance; good relations, considerable. 

Darcy (22), after reviewing the relevant literature, concludes that bilin-) 
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guals usually do less well on verbal intelligence tests than monoglots, but do 
not differ in achievement on performance tests. This is confirmed by Johnson 
(61( who worked with bilinguals whose second language was Spanish. John- 
son’s ingenious device for gauging degree of bilingualism should be men- 
tioned, as it is apparently a better index than the Hoffman Test. The index 
is the ratio of the number of words in the first language recalled in five min- 
ures to the number of words in the second language recalled in the same 
interval of time. 

For studies of other variables which influence mental test performance 
see Estes (33); Fischer (35); and Klatskin (66). 

Speech and language.——See Darcy (22); Du Pont (30); Irwin, (58); Klat- 
skin (66); Mowrer (84); and Werner & Kaplan (117). 

Perception, conception, thinking, memory.—While the studies in the area 
named are motley and difficult to relate, they point often to the develop- 
mental trend in thinking from syncretic to differentiated, from concrete to 
abstract, and from egocentric to social. 

Following a Piaget type (89) of analysis of the reasons children gave for 
their responses in a sharing test, Ugurel-Semin (113) confirms the sequence 
he suggests. Following is the developmental sequence Semin noted: ego- 
centric response, knowledge of the moral rule but action still egocentric, 
adult-dictated rule governs conduct, child assesses rule, modifying it in terms 
of his own experiences. 

The development in the understanding of time which occurs between the 
years four and six has been described by Springer (108). 

A contribution which lends some support to the genetic theory of con- 
sonance comes from Bugg & Thompson (14). They noted (a) that students 
who had learned to be appreciative of dissonant music rated a number of 
presented intervals the same in consonance as did a group of students who 
disliked dissonant music and (b) that the rating of the first group did not shift 
as their liking for dissonant music increased. 

The law of primary reinforcement was verified by Siegel & Fashee (102) 
using preschool children, a Skinner lever-pressing device, and candy pellets as 
rewards. The more reward the children received the more difficult was ex- 
tinction of the lever-pressing response. 

There has long been interest in the question of whether childhood really 
is the happy period it seems when viewed from the adult point of vantage. 
No definitive answer is supplied to this question and several related issues 
by the studies of Smith (103). Tuckman & Lorge (111), and Barschak (10). 
Rather they tend merely to confirm the view that adults look upon the period 
of childhood as the halcyon days and think with no relish on getting old. 

Major studies already mentioned which deal with problems of percep- 
tion, conception, and thinking, are as follows: Ames (3); Ames et al. (4); 
Goodman (45); Werner & Kaplan (117). 
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LEARNING"? 


By Epwarp C. TOLMAN AND LEO PosTMAN 
University of California, Berkeley, California 


THEORY 


Books by Hull (74) and Guthrie (62) have brought their respective S-R? 
theories up to date. A brief article by Tolman (165) gives a précis of his 
most recent general formulation. MacCorquodale & Meehl (110) have pre- 
sented an elaboration of an earlier paper (112) in which they formalized ex- 
pectancy theory by a set of postulates. They include reinforcement in addi- 
tion to frequency of presentations, S,-R,—S:, as probably an important 
determiner of expectancies.’ This is the only point in their discussion to which 
the present reviewers would definitely take exception. 

Hull’s last book (74) is of great significance and will long stand as a me- 
morial to his creative genius. It presents in final form his basic postulates 
and corollaries. It also contains many ingenious, though often not very rigor- 
ous, applications of these postulates and corollaries, through the derivation 
of theorems, to most of the factual problems of animal learning. One feature 
of the system as now presented, in which we naturally concur, is his latest 
equation for Reaction Potential, 


on ‘oe 
sEr = DXViX K X sHr 


(where D stands for drive strength, Vi for stimulus-intensity dynamism, K 
for incentive motivation, and gHp for habit strength). This formula makes 
somewhat the same distinction between learning (sHpr)* and performance 
(sEr)* that we would. Two basic differences between Hull’s system and ours 
are, however, to be noted: (a) Hull conceives what is learned to be an gsHr 
whereas we conceive it to be an expectancy, a “‘readiness” for an 5; to be fol- 
lowed by an S: provided a certain type of R; occurs; and (b) Hull still asserts 
the acquisition of an gHp to be a function of frequency of reinforcement 
(though no longer of amount of reinforcement). We would assert the acquisi- 


1 This review is based on publications covering the period from about May 1, 
1952 to May 1, 1953. The writers wish to express their gratitude to Mr. J. Trevor 
Peirce for bibliographical assistance, and to Miss Pauline Austin for help in reading 
the proofs. 

2 The following symbols appear in this chapter. When special meanings are needed 
additional symbols are added to the basic letters. The meanings intended are ex- 
plained by the authors where these first appear. CR (conditioned response); CS 
(conditioned stimulus); D (drive, drive strength); sEr (excitatory strength); sHr 
(habit strength); Jz (reactive inhibition); sZz (conditioned inhibition); M (means); 
pR (pointer-reading response); R (response); rg (fractional anticipatory goal re- 
sponse); rs (central neural surrogate response); S (stimulus); s¢ (fractional goal 
stimulus); US (unconditioned stimulus). 

3 We have gratefully borrowed some features of their notation. 
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tion of an expectancy to be primarily a function of frequency of past occur- 
rences of S:—Ri—S2, whether or not S: has had positive, negative or zero 
incentive value. However, positive or negative incentive values acting as per- 
ceptual emphasizers undoubtedly speed up acquisition. 

Gleitman et al. (53) have attacked Hull’s doctrine of extinction. They 
point out, for example, that insofar as Hull now defines responses not as 
specific muscle contractions but as oriented adjustments to the environment, 
Tr? (conceived as attributable to the building up of specific fatigue-like prod- 
ucts) loses much of its applicability. Further, insofar as extinction is also 
said to be due to the building up of sIr?, progressive increase in the latter 
should eventually overtake the amount of sHpr and cause the response to dis- 
appear permanently. Gleitman et al. also point to the difficulties, as had 
Ritchie (134), of the concept of the ‘‘not-response.”’ 

Seward has continued his bridge between S-R theory and expectancy 
theory by his construct of rs? (cf. 143). The rs performs for Seward much the 
same function that rg? and sg? perform for Hull and Spence. The rs is an as- 
sumed neurological mechanism by which the significate and also its goal 
character, if it should have one, are brought back to the S at the choice point 
and made a determiner of R at that point. In two recent papers (144, 145), 
he has specifically applied this concept of rs’s to the acquisition of motives. 

The new edition (62) of Guthrie’s book presents again his doctrine of 
single-trial S-R contiguity learning. Guthrie also asserts once more his doc- 
trine that the R which gets conditioned is the ‘‘movement” which removes 
the organism from the given S. So-called adjusted, purposive ‘‘acts’’ are for 
him purely fortuitous, happy (or perhaps unhappy) combinations of condi- 
tioned ‘“‘movements.”” It would seem to the reviewers that nature must be 
even cleverer about providing the right environmental stimuli for combining 
movements into adjusted, directed acts than we would have supposed her 
capable of. 

There have also appeared several discussions concerning the legitimate 
objectives of learning theory. Kendler (81) argues that a behavioral theory 
can as such merely seek for operationally specifiable intervening variables. 
Insofar as the theorist gives substantive meanings to these intervening vari- 
ables (i.e., asserts hypothetical constructs) he is going in for reifications which 
are not legitimate since they have no truth value. These constructs may, 
however, be helpful to the particular theorist, given his private style of think- 
ing. Ritchie (135) in an oblique and witty reply points out that, if geogra- 
phers had held to this same ‘‘modern methodology,’”’ they would not have 
found it permissible to ask whether the earth were round or flat! But, if 
Kendler be right, they should have arrived at the same rules for navigation. 
Ritchie’s moral appears to be that if one assumes, for example, the model 
properties of S-R connections one really will arrive at different predictions for 
behavior than if one assumes the model properties of expectancies. Davis (32), 
at odds with all theoretical constructs at the merely behavioral level, suggests 
that what we need is a “physical psychology” in which stimuli, responses, 
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and intervening processes would all be defined by their physical and physico- 
chemical characteristics. Kessen & Kimble (83) in criticizing Krech’s neuro- 
logical ‘‘dynamic systems” (90, 91) argue valiantly for the legitimacy of any 
sorts of theoretical constructs be they drawn by analogy from neurology, 
physics, sociology, aesthetics, or whatever. 

Our classification of the empirical studies reflects the fact that our sys- 
tematic approach leads us to employ four types of constructs in the analysis 
of stimulus-response associations, viz: discrimination, expectancy, drives 
and incentives, and performance. In line with this conceptual analysis we 
shall distribute our summaries of the empirical studies under the headings: 
(a) discrimination learning; (b) acquisition of expectancies in classical and 
instrumental conditioning; (c) performance in classical and instrumental con- 
ditioning, especially as determined by drive conditions; and, cutting across 
these categories, (d) verbal learning, and (e) motor learning. 


DISCRIMINATION LEARNING 


The process of discrimination is central to an expectancy theory of learn- 
ing. In such a theory, the organism is conceived to discriminate and catego- 
rize environmental stimuli as signs, means, and significates. Our analysis of 
discrimination learning will focus on two major problems: (a) conditions 
governing perceptual discrimination per se; (b) the ways in which the dis- 
criminated features of the environment come to function as signs. 


CONDITIONS OF DISCRIMINATION 


When differential responses are systematically related to an orderly series 
of stimuli (such as black, gray, and white), a dimension of discrimination has 
been established. In experiments on discrimination learning, stimuli located 
at different points along such dimensions come to serve as differential signs 
to positive and/or negative significates. The more firmly the relevant dimen- 
sions of discrimination have been established, the more readily can expectan- 
cies be learned. A number of studies help to throw light on the process of per- 
ceptual learning in general as well as on the acquisition of specific dimensions 
of discrimination. 

Early learning —Hebb’s (69) theory of perception, with its emphasis on 
the importance of early perceptual learning, has stimulated a number of 
empirical studies. Forgays & Forgays (47) and Hymovitch (76) studied the 
effects of varied infant environments on problem-solving by the adult rat. 
Both found that early opportunity for perceptual experience may have last- 
ing beneficial effects. Partial support for this conclusion also comes from a 
study of Bingham & Griffiths (15). 

The influence of early training on discrimination learning was studied by 
Siegel (149) who showed that ring doves reared under conditions of visual 
deprivation were significantly retarded in the later solution of a discrimina- 
tion problem. According to Griffiths & Stringer (59), variations in the inten- 
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sity of early stimulation were not, however, found to produce significant ef- 
fects on adult learning in rats. 

Perceptual training experiments——Hypotheses about perceptual learning 
have also been tested in experiments investigating the effects of training on 
the discrimination of specific classes of stimuli. Lawrence, who had previously 
presented important evidence for the acquired distinctiveness of cues (98, 
99), used a transfer situation to study the conditions governing the establish- 
ment of a new dimension of discrimination in rats (100). A difficult discrim- 
ination is established more effectively if it is preceded by training on an easy 
discrimination along the same dimension than if all training is given directly 
on the difficult stimulus pairs. A gradual change from an easy to a difficult 
discrimination is optimal for the establishment of the dimension. We would 
certainly agree with Lawrence’s conclusion that increase in sHpr is not the 
only factor in discrimination learning, and that the animal must learn ‘‘to 
isolate functionally the relevant stimulus dimension from other background 
and irrelevant cues” (100, p. 516). 

Several experiments have demonstrated the effects of training on per- 
ceptual discrimination in human subjects. Smith (152) was able to produce 
increases in the apparent size of geometric forms by giving his subjects con- 
trolled experience with shapes of large physical size. The effects of the ex- 
perience were specific to the forms used in the training. Bevan & Zener (12), 
investigating the effects of practice on the perceptual thresholds for geome- 
tric designs, obtained general practice effects in recognition as well as in- 
creased sensitivity to the specific figures practiced. Atkinson & Ammons (8) 
found that latency of recognition decreases with practice on a given stimu- 
lus but failed to find evidence for a perceptual ‘‘learning to learn.” Closely 
related in its implications to the studies just reviewed is an investigation of 
Forgays (46) which showed that (a) tachistoscopic recognition thresholds for 
words decrease with educational experience and (b) superior sensitivity to 
words presented to the right of fixation, as compared with words presented 
to the left of fixation, is similarly a function of educational level. These re- 
sults, in agreement with previous findings from Hebb’s laboratory (116, 
125), point to the learned nature of highly specific visual discriminations. 

The learning of a discrimination may often depend on the performance 
of an orientating response or “‘receptor-exposure act.”’ The beneficial effects 
of.training on discrimination may be due partly to the establishment of such 
orienting responses. Wyckoff (183) presents an elaborate theoretical dis- 
cussion of the conditions governing the acquisition of observing responses in 
terms of reinforcement theory. He proposes that the discriminative stimuli 
come to function as secondary reinforcers for the observing response once 
the subject has learned to respond differentially to the discriminative stimuli. 
The author recognizes the danger of circularity in this argument and at- 
tempts to escape from it by a series of quantitative assumptions concerning 
the effects of primary and secondary reinforcement. The reviewers could not 
help but feel that his highly elaborate analysis represents a tenuous ad hoc use 
of the principle of secondary reinforcement. 
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Differential effectiveness of discrimination stimuli.—In analyzing response 
to a given stimulus unit, we must take account of the organism's discrimina- 
tion system. By discrimination system we refer to the organisms’ propensi- 
ties to categorize and order stimuli along dimensions of discrimination. The 
discrimination system is specified in terms of the psychophysical functions 
characteristic of the organism. Several studies have appeared during the year 
which contribute to the specification of the discrimination systems of several 
species of laboratory animals. 

Amsel (5), using rats, studied the rate of acquisition of a brightness dis- 
crimination as a function of the duration of the discrimination stimuli which 
consisted of black and white runways preceding a choice point. He found 
speed of learning a direct function of stimulus duration. A previous study by 
Eninger (37), using auditory discrimination stimuli had, on the other hand, 
found the shorter durations more effective. Another study by Eninger (38) 
compared the effectiveness of compound discrimination stimuli involving 
both visual and auditory sense modalities with stimuli involving only the 
one or the other of these modalities. The compound stimuli were more ef- 
fective and visual cues resulted in faster learning than did auditory ones. 
Greater effectiveness of visual stimulus components, as compared to audi- 
tory and tactual ones, was also reported by Fink & Patton (42). The relative 
effectiveness of color and form in the discrimination learning of monkeys was 
compared by Cole (30). In tests pitting color against form he found color to 
be the dominant cue; further tests showed, however, that form discrimina- 
tions may be learned simultaneously with the dominant color discrimina- 
tions. Further evidence for the simultaneous establishment of discrimination 
along two dimensions (size and brightness) is presented by Grice (57) in a 
study of visual discrimination learning in rats. 


THE DISCRIMINATION OF SIGNS 


What are the ways in which the discriminated features of the environ- 
ment come to function as signs? From the point of view of the present anal- 
ysis the two stimuli in a discrimination setup come to function as differ- 
ential signs, one pointing to a positive, the other to a negative significate. 
Hence, in an expectancy theory, the problem of the nature and temporal 
development of the differentiation between the positive and negative signs 
is an important issue. 

The problem of patterning.—A significant contribution to this problem 
has been made by Bitterman and his associates in a series of papers which 
began some three years ago and of which several appeared during the current 
year (16, 161, 167, 182). The general thesis of these papers is that a distinc- 
tion must be made between two nonadditive processes of perceptual organ- 
ization in discrimination learning: (a) configurational learning which involves 
a ‘primitive, diffuse” perception of the situation, and (d) relational learning 
in which the organism comes to respond selectively to the abstracted sepa- 
rate sign features in the situation. This latter process of relational learning 
corresponds to Lashley’s (96, 97) analysis of the discrimination process 
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whereas configurational learning conforms to the description of Gulliksen & 
Wolfle (61). Using this two-factor theory, Bitterman and associates have 
been able to account for their own experimental findings which conform to 
neither Lashley’s nor Spence’s (153, 154) theories. Notable among the em- 
pirical findings was the speedier acquisition of a successive as compared with 
a simultaneous discrimination (179). This finding was not, however, con- 
firmed by Spence (155). It may be mentioned in this connection that Loess 
& Duncan (103), comparing successive and simultaneous discrimination 
learning in humans, found (a) no difference between the two methods in 
learning an easy discrimination, and (b) simultaneous presentation superior 
in the learning of a difficult discrimination. 

Spence (155) has criticized Bitterman’s analysis, at the same time extend- 
ing his own theoretical views so as to encompass the case of successive dis- 
crimination. Compounds such as ‘‘white-left” and ‘‘black-right’’ may acquire 
excitatory potential for the response of approaching. Such ‘‘transverse pat- 
terning”’ will, however, occur only under conditions which do not allow learn- 
ing on the basis of simpler cue-position patterns. Bitterman (16), in a reply 
to Spence, summarizes the evidence for a two-factor theory and objects to 
the ‘‘theoretical and experimental constriction’”’ of Spence’s position. 

The implications of Spence’s theory of discrimination learning for trans- 
position behavior were tested by Hunter (75) in a series of experiments on 
size discrimination in children. His results uniformly favor interpretation in 
terms of relational learning rather than reinforcement of absolute cue proper- 
ties. On the other hand, a study by French (48), showing the dependence of 
discrimination learning in humans on the number of common stimulus ele- 
ments receiving consistent reinforcement, is in agreement with an interpre- 
tation in terms of stimulus generalization. In this connection, mention should 
be made of an extension of Spence’s analysis to multiple-choice learning in a 
paper by Cook (31). 

Discrimination reversal—Studies of discrimination reversal have fre- 
quently been viewed as ‘‘crucial experiments”’ for testing divergent interpre- 
tations of discrimination learning (as, for example, in the continuity-dis- 
continuity controversy). A modified continuity position is presented by 
Gatling (50) in his investigation of the effects of repeated stimulus reversals. 
His results show progressive decreases in negative transfer effects as rever- 
sals are continued. He ascribes the increased efficiency of learning to the 
simultaneous operation of two factors: (a) a continuing build-up of habit 
strength which is transferred from reversal to reversal; and (b) the mastery 
of the reversal problem per se, i.e., essentially the acquisition of a reversal- 
learning set. Several studies of discrimination reversal in human subjects 
appeared during this year. In a series of three papers, Buss (22, 23, 24) ad- 
dresses himself to the problem of “rigidity” in discrimination-reversal learn- 
ing, i.e., to the conditions determining readiness to shift from old to new dis- 
criminations. He found (a) differential and partial reinforcement during orig- 
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inal training more favorable to reversal than continuous reinforcement; 
(b) the ability to shift depends on the absolute values of the cues reinforced 
during training rather than on the relation among the cues. An interpretation 
of “‘rigidity’’ as learned behavior is also proposed by Schroder & Rotter (141) 
who show that the propensity to shift from one solution to another in a card- 
sorting problem depends on learned expectancies of alternative solutions. 
Results favoring the continuity hypothesis in human discrimination learning 
are reported by Walk (176). In two experiments reversal of the cues used in 
the preliminary training slowed down learning of a visual discrimination al- 
though the effects were of limited duration. Walk states that the positive 
stimulus appeared to have greater weight in determining discrimination than 
did the negative stimulus. It should be noted in this connection, however, 
that in a study of visual discrimination in rats Fitzwater (43) found prelim- 
inary training, which included experience with the positive and negative 
stimuli presented separately, more favorable than reinforced practice on the 
positive stimulus alone. 

Discrimination-learning sets—The continued confirmation of a series of 
related expectancies may give rise to generalized expectancies (‘‘beliefs’’) em- 
bodying the common features of the specific expectancies. It is in terms such 
as these that we would conceptualize the learning sets demonstrated by 
Harlow (64). At the same time, we would point out that the phenomenon of 
learning sets underscores the inadequacy, or at least the limitations, of an 
analysis of discrimination learning based on the reinforcement of absolute 
cue values. Several experimental analyses of discrimination-learning sets ap- 
peared during the year. Harlow & Warren (66) report the establishment of 
learning sets in a series of 450 discrimination problems using planometric 
stimulus objects (the previously reported studies had used stereometric ob- 
jects). Although learning sets were again demonstrated, the inter-problem 
improvement tended to be less than in the earlier studies, a result which is 
attributed to the greater similarity among the stimuli used in the new ex- 
periments. Riopelle (133) presents an experimental analysis of the acquisi- 
tion of learning sets in monkeys for the solution of new and reversed dis- 
crimination problems. Initial disparities between the learning rates for the 
two types of problems gradually disappeared as the animals gained expe- 
rience. Riopelle suggests that the acquisition of a learning set involves ‘‘trans- 
fer suppression,”’ i.e., performances on successive problems in general, and 
reversal problems in particular, become increasingly independent of each 
other as the training continues. The course of the acquisition of discrimina- 
tion-learning sets in chimpanzees and in children was studied by Hayes and 
associates (67). Their results point to qualitatively different stages in the 
acquisition of discrimination-learning sets: an early stage in which abstract 
principles are learned and a later stage characterized by the improvement of 
performance skills and work habits. The data fail to yield evidence for inter- 
species differences in the ability to acquire sets. In a further paper, Hayes 
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et al. (68) investigate chimpanzees’ ability to learn various numbers of dis- 
crimination problems simultaneously. A high degree of accuracy can be ob- 
tained with as many as 20 simultaneous problems. 

We cannot leave the problem of discrimination learning without noting 
Harlow’s (65) eloquent insistence that the motivation of his primate sub- 
jects in discrimination experiments is not to reduce visceral needs but to sat- 
isfy their curiosity and to manipulate environmental objects. Working in 
Harlow’s laboratory, Butler (25) has presented convincing evidence for the 
strength and persistence of a visual-exploratory motive which is not derived 
from other biological drives. 


ACQUISITION OF EXPECTANCIES IN CLASSICAL AND 
INSTRUMENTAL CONDITIONING 


Before turning to the actual studies a brief expansion of our theoretical 
analysis will be helpful. The basic independent variable governing the acqui- 
sition of an expectancy is, according to us, one or more occurrences of a 
“stimulus-response-stimulus’”’ sequence. We shall symbolize such an event 
by {(S:, M):Ri—-S:2}. This is to be read: an initial stimulus S, is presented 
and with it an associated ‘‘behavioral support” (see Tolman, 164, p. 96f) or 
“‘means,’’ M, and, given this S; and this M,? a response-act, Ri, occurs and 
results in the stimulus S2. Moreover, since such a sequence usually occurs 
within some drive context and may take place more than once, and since S, 
may have either positive, negative, or zero incentive value, the complete 
symbol will be DD{(S,, M):R:—S.,*} where D stands for an active drive or 
drives; > for number of occurrences; and (*) for the positive, negative or 
zero incentive value of S:. Further, the consequence of such DY {(Si, M): 
Ri—S2*} occurrences is conceived by us to be the acquisition of an expect- 
ancy, symbolized as (S,:.R:S2*). Such an expectancy is to be read: if a ‘“dis- 
criminated” stimulus of type 5; be present, and if the ‘‘discriminated”’ re- 
sponse-act of type R; be made, a “discriminated” stimulus of type S2 with 
“‘discriminated”’ incentive value (*) will appear. Given such an expectancy, 
if there be a drive D relative'to which S;,* is either positive or negative, then 
upon the presentation of S,, there will occur a disposition to ‘‘go to,” ‘‘es- 
cape from,” ‘‘avoid,”’ or ‘‘merely wait for’’ S.*. And this disposition will be- 
tray itself by a specific performance or pointer reading response pR (e.g., 
salivating, sweating, running, jumping, etc.) which is what the experimenter 
observes and measures. 

In Table I are listed eight independent variables the consideration of 
which is suggested by the above analysis. These provide us with the cate- 
gories under which we shall order the empirical studies. 

Type or intensity of drive, D—(a) Noxious light as D: Flynn & Jerome 
(45) were able to teach rats multiple-choice-box problems where D was the 
drive to avoid strong light and S,* a dark goal compartment. Caldwell & 
Womack (27) found also that albino mice could learn a maze to get out of 
light. (b) Intensity of relevant D: Teel (160) found, using rats, that rates for 
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TABLE I 
INDEPENDENT VARIABLES GOVERNING THE ACQUISITION OF EXPECTANCIES 








1. Type or intensity of D2 

2. Type or intensity of S;. 

3. Features of M. 

4. Quality and incentive value of S.* (Type and amount of reinforcement). 

5. Frequency and probability with which S,* follows R; (Frequency and pattern 
of reinforcement). 

6. Time delays between R,; and S,* (Delay of reinforcement). 

7. Intertrial intervals. 

8. Derived incentive value of S,* (Secondary reinforcement). 





both learning and extinction did not differ for deprivation periods of 1, 7, 15, 
and 22 hr. of hunger. Apparently strength of D over a normal range does not 
affect acquisition though it may affect performance. (c) The effect of an ir- 
relevant D on conditioning: The effect of an irrelevant drive upon the estab- 
lishment of an association is illustrated in an experiment by Bahrick (9). 
This investigator subjected rats to sensory preconditioning with light and 
buzzer under two degrees of food deprivation. He then conditioned the rats 
to buzzer as sign for shock. Subsequently, he trained them on the light as 
sign for shock. The previous pairing of light and buzzer under high depriva- 
tion led to more rapid learning to avoid the light. Spence & Taylor (158) and 
Spence & Farber (157), using eyelid conditioning, found that anxious sub- 
jects, as measured on items from the Minnesota Multiphasic Personality 
Inventory, condition more readily and extinguish more slowly than non- 
anxious subjects. In such a situation, anxiety may be considered an irrele- 
vant drive. Bitterman & Holtzman (17), using the galvanic skin response 
(GSR) also found that anxious subjects selected by a number of clinical 
measures, conditioned more rapidly and extinguished more slowly than non- 
anxious subjects. (¢d) Effect of anxiety on trial and error: Farber & Spence 
(40) found with a stylus maze that anxious subjects did more poorly, es- 
pecially on difficult choice points. Where difficult discriminations have to be 
made between S;’s leading to different S,*’s (blind ends versus true-path 
segments) anxiety may interfere with the building up of the appropriate 
expectancies or, as the authors suggest, tends to enhance indiscriminately all 
the response tendencies present. 

Type of intensity of initial stimulus, S;—The readiness with which an ex- 
pectancy is acquired may depend on the quality or intensity of the initial 
stimulus. It is to be noted, however, that we have already discussed most of 
the experimental evidence illustrating this relationship in the section above 
on conditions of discrimination. 

Features of behavior support, M.—We shall consider here a group of ex- 
periments in which the environmental arrangements supporting the animal’s 
response appear to be an important variable. (a) Time intervals in condition- 
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ing (that is, environmental supports for waiting): White & Schlosberg (180) 
conditioned a GSR with light as S; and shock as S,* with different intervals 
(M’s) requiring different R,'s of waiting between S, and S,*. They found that 
an interval of 0.5 sec. resulted in the fastest conditioning. They also note that 
the optimal time intervals for autonomic and skeletal conditioning are about 
the same. Fitzwater & Reisman (44) compared forward, backward, and si- 
multaneous presentations of S; and S,* in instrumental finger withdrawal. 
The forward setup where S; preceded S;* by 0.5 sec. here also produced the 
clearest evidence of conditioning. Kaplan (80), using intermittent reinforce- 
ments, taught rats to push a lever where the resultant S,* was the going off, 
for 1.1 min., of a noxious light. For different groups the intervals between 
reinforcements were 0.2, 0.5, 1.0, 2.0, 3.0, or 5 min. The longer the time in- 
tervals the slower the rate of responding. That is, the longer the average time 
between the R,’s in any interval and the S,* at the end of that interval the 
less readily was the expectancy formed. (b) Supports for space and response 
learning: Hill & Thune (70) ran two groups of rats on single-choice elevated 
mazes. Their findings, in contrast to the earlier one of Tolman, Ritchie & 
Kalish (166), was that a ‘‘turning’’ R; was acquired more readily than a “‘go- 
ing towards a given place’’ R;. However, their maze-path arrangements 
(i.e., M) were different from those of Tolman et al. and seem to us to have 
favored response learning since the response group always made a right- 
angle turn whereas the place group had to make both right angle and obtuse 
turns. Moreover, the goal place did not occupy as distinctive a position with 
respect to the source of light. (c) Introduction of new M: Page & Hall (126) 
extinguished a conditioned avoidance response by restraining the rats (intro- 
ducing a new M) for a brief interval after the coming of the CS? in the for- 
mer shock compartment for five trials, and then testing in a series of ordi- 
nary extinction trials. These rats extinguished more rapidly than a control 
group not given this first ‘‘restraint’’ experience. 

Quality and incentive value of the terminal stimulus, S2* (Type and amount 
of reinforcement).—Furchtgott & Rubin (49) ran rats, all 22 hours hungry, in 
a two-unit maze to different amounts of food: 2500 mg., 250 mg., 75 mg., 20 
mg., and 0 mg. No differences in the learning curves were obtained save for 
the 0 mg. group which showed little manifest learning. We would suspect, 
however, that, if food had then been introduced, the 0 mg. group would have 
shown that they had achieved “‘latently” the same amount of actual learn- 
ing. Bitterman, Reed & Krauskopf (18) compared the conditioning of GSR 
to light with a US (shock) of 3 sec. duration (i.e., strong negative reinforce- 
ment) and a US? (shock) of only 0.5 sec. duration (i.e., weak negative rein- 
forcement). No differences were found. On the other hand, Spence (156) con- 
ditioned eyelid responses under two different strengths of US (air puff). On 
the next day half of each group of subjects was tested under strong and half 
under weak air puff. The number of CR’s? was significantly more for the 
strong than for the weak US both in training and in the test. Again the ques- 
tion remains open as to whether the strength of the negative unconditioned 
stimulus affected learning or performance, 
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A study by Seeman & Williams (142) is concerned with the ‘‘Hull- 
Leeper’’ problem of the conditions under which different expectancies can be 
attached to different drive states. The animal must learn to discriminate 
between two drives as signs to different S,*’s. These experimenters taught 
rats, when hungry, to run around the left arm of a maze to the goal box and 
food and, when thirsty, to run around the right arm of the maze to the same 
goal box and water. For the experimental group the final common path and 
the goal box were black for food and white for water. For the control group, 
they were black for both food and water. The former group learned faster. 
The two more distinct S,*’s correlated with the two different drives favored 
acquisition of the two expectancies. 

In general it is our view that it is frequency of stimulus-response-stimulus 
occurrences and not amounts of reinforcement which primarily affect ac- 
quisition. We believe, in contrast to Hull, that learning may occur in the ab- 
sence of reinforcement. This last hypothesis is of course tested in experi- 
ments on latent learning to which we turn next. 

Latent learning.—Interest in this problem continues unabated. The 
question no longer is whether latent learning does or does not occur. Rather 
a number of specific experimental variables are being explored in the hope of 
specifying the conditions of latent learning more and more precisely. (a) 
Spatial position of irrelevant incentive: Thistlethwaite (163) and McAllister 
(108) both investigated the effect on latent learning of the spatial position of 
food (irrelevant incentive) relative to the water when preliminary training 
was under thirst, and water was on both sides. Both investigators found that, 
when the food was placed close to the water, the rats did not show latent 
learning, but when the food was at some distance prior to the water they 
did show it. (6) Degree of motivation for relevant incentive: Johnson (79), 
using a T-maze with food in one goal box but no water in either arm, ran 
rats with different degrees of thirst. He found that the greater this original 
thirst, the greater the number of trials to learn to run to the food side when 
the rats were made hungry. This finding suggests that the more thirsty ani- 
mals were perceptually more set for water (even though it was not there) 
and hence less ready to perceive the food. (c) Drive discrimination: Thistle- 
thwaite (163) also tested the effect of preceding practice in drive discrimina- 
tion. Two thirds of his rats were given preliminary practice in going to the 
food end of a straightaway when hungry and to the water end when thirsty. 
It was only these animals which showed latent learning. These results con- 
firm the earlier findings of Christie (29). All such findings suggest that when 
in latent learning experiments the animal is switched to a new drive he must 
discriminate this new drive as well as remember where the appropriate goal 
S:* has been if he is to perform correctly. (d) General maze-wiseness versus 
learning of true path: Muenzinger & Conrad (119), after giving their rats 
one, two, or three days of nonrewarded training in a six-unit symmetrical 
maze, reversed the path and tested under reward. Prior to introduction of 
reward, errors had been dropping out progressively. However, when the 
true path was reversed there was negative transfer only for the three-day 
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group. The authors therefore distinguished between a general maze-wiseness 
acquired in the first two days, and a specific latent learning of the true path 
acquired later. (e) Place of prefeeding: Meehl & MacCorquodale (113) ran 
hungry unrewarded rats in the Blodgett maze for five days. Then the ani- 
mals were prefed in a maze-like section placed in front of the starting box. 
When put back in the maze they exhibited a further drop in errors. The 
authors ascribe the effects of this prefeeding to a conditioning of the hunger 
drive to the maze environment. This conditioned hunger drive adds to the 
total drive strength which multiplies with habit strength to produce sEr. 
It seems to the reviewers a surprising assumption that at 22% hr. of depriva- 
tion one prefeeding experience would add a substantial increment to the 
effective drive. (f) Effectiveness of unrewarded preliminary exploration: 
Gilchrist (52) gave different amounts of preliminary exploration of a T-maze 
without food and then trained with food. The more the preliminary explora- 
tion the faster the subsequent learning. Control groups exposed to similar 
amounts of previous exploration but with food showed no faster subsequent 
learning. Again it appears that it is the frequency of presentation rather than 
presence of positive reinforcement as such which determines acquisition. 
Frequency and probability with which terminal stimulus, S2*, follows the 
response R, (Frequency and pattern of reinforcement)—(a) Frequency: The 
influence of frequency per se is by now so well established that there appears 
to have been only one study during the year dealing with this variable alone. 
Siegel & Foshee (150), training children to press a bar for a candy reward, 
found that greater frequency in the original training caused greater resist- 
ance to extinction. (b) Probability of reward (reinforcement): A number of 
studies have been concerned with speed of acquisition and with resistance to 
extinction as determined by various schedules of partial reinforcement. 
Grant & Schipper (55) used 25 per cent, 50 per cent, 75 per cent, and 100 
per cent reinforcements in eyelid conditioning. In general the greater the 
percentage of reinforcement, the faster the conditioning. They also found 
that the 50 per cent and 75 per cent groups showed greater resistance to ex- 
tinction. Lewis (101), using a gambling setup with boys, also found that 50 
per cent and 60 per cent groups extinguished more slowly than a 100 per cent 
group. However, Rubin (140), using a panel-pushing response with rats with 
distributed practice (40 min. intertrial intervals), found greater resistance to 
extinction for the continuously reinforced group. Grant, Schipper & Ross 
(56), with eyelid conditioning, did not, however, find that intertrial spacing 
reduced resistance to extinction following partial reinforcement. They sug- 
gest that the presence of human verbalization may account for the difference 
between their results and those of Sheffield’s (148). In two experiments Bit- 
terman and associates compared 50 per cent reinforcement when presented 
randomly and when presented alternately. With both humans (106) and 
rats (168) the alternate group extinguished faster. It appears to the review- 
ers that the results of the partial reinforcement experiments continue to pose 
a serious problem for S-R reinforcement theory. It is equally true that ex- 
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pectancy theory has not yet been elaborated to a point where it can give a 
consistent interpretation of this total body of data. The results do point to 
the conclusion, however, that organisms can learn probabilities, i.e., that 
probability of reward is one of the environmental features about which ex- 
pectancies are formed. 

Time delays between response, Ri, and terminal stimulus, S,* (Delay of re- 
inforcement).—(a) Unfilled versus filled delay: Ferster (41) trained pigeons 
in a Skinner box with aperiodic partial reinforcement (average every 1 min.). 
He then introduced a 60-sec. delay (the box went dark) between successful 
peck and reinforcement; rate of pecking decreased. However, if the length 
of delay was increased gradually from 0 to 60 sec., the original pecking rate 
could be maintained. Each bird then filled in the delay with its own ‘“‘super- 
stitious’”’ response. In another experiment birds learned to keep up the rate 
of pecking to one color of the target key which did not give food but which 
did (average every 1 min.) produce a second color of the target key in an ar- 
rangement by which this second color was followed regularly and automati- 
cally (after 1 min.) by food. The birds continued to peck but more slowly dur- 
ing this delay period. The pigeon apparently cannot learn to expect a de- 
layed goal unless during the delay it can develop some sort of a bridging re- 
sponse (R,) (superstitious or otherwise). (b) Long versus short delays: 
Logan (104) used a double-bar Skinner box. One bar delivered a pellet after 
1 sec., the other after 5 sec. The short-delay bar was responded to faster and 
was preferred. Lambert & Solomon (94), after teaching rats to run a straight 
alley (with a number of jumps in it) to food, tested extinction by putting a 
block near the start, near the goal, or merely by removing food. Block near 
the start caused most rapid extinction, block near the goal the next most 
rapid, and no food produced the slowest extinction. 

Intertrial intervals —Stanley (159) extinguished a learned turning re- 
sponse in a single T-maze under massed (every 15 sec.) and under distrib- 
uted (every 15 min.) extinction trials. There was faster extinction for the 
distributed trials. He also tested for extinction under satiation. The rats 
arrived equally soon at a 50 per cent choice whether their trials were massed 
or spaced. Baron (10) found with delayed eyelid conditioning no differences 
in original learning between a mean intertrial interval of 270 sec. and one of 
120 sec. Vandermeer & Amsel (175) also found with eyelid conditioning no 
difference between spaced and massed trials. Teichner (162), teaching rats 
to depress a food tray in response to a signal, did find distributed trials bet- 
ter. Holland (72), teaching rats to choose between two levers, also found dis- 
tributed trials better. As in other areas of investigation (see below under ver- 
bal and motor learning) the effects of distribution on acquisition and extinc- 
tion seem to defy any simple generalization and to depend upon specific ex- 
perimental conditions. 

Derived incentive value of terminal stimulus, S,* (Secondary reinforcement). 
—Although there is by now a considerable body of experimental evidence in 
support of so-called secondary reinforcement, the nature of the “‘secondary 
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reinforcers” is still moot. Even from the point of view of a reinforcement 
theory the distinction between primary and secondary reinforcer has be- 
come difficult. The presence or absence of biological need reduction has failed 
as the criterion. From our point of view environmental features can acquire 
incentive value and thus serve as way stations (subgoals) to the attainment 
of already existing goals. To what extent such subgoals can become goals in 
their own right seems to us to be an unresolved experimental question. 

McClelland & McGown (109) found that, if there was an irregular and 
unpredictable association between a circular goal alley and when and how 
the rats were fed in it, this goal alley acquired more ‘‘secondary reinforce- 
ment”’ (i.e., subgoal-incentive value) than it did for rats which had always 
been fed regularly at one point in it. Bugelski, Coyer & Rogers (21) found 
no evidence that pre-nonfeeding in a goal box (after learning) caused ex- 
tinction of a simple runway response. This was a repeat, with additional con- 
trols, of the earlier Seward & Levy experiment (146) which had seemed to give 
evidence of secondary nonreinforcement. 


PERFORMANCE IN CLASSICAL AND INSTRUMENTAL 
CONDITIONING 


If there is an expectancy ($,:.R,S2*), and if S,* has either positive or nega- 
tive incentive value, the organism will have a disposition to get to, or to get 
away from, S;*. The strength of the disposition is a joint function of the 
strength of the expectancy, the incentive value of S,* and the drive condi- 
tion. Specific performances serve as pointer-reading responses, pR’s, for in- 
ferring these dispositions to get to, or get away from, S,*. In short, we are 
maintaining the distinction between learning and performance suggested by 
Tolman (164) and Lashley (95), and more recently, reflected in the distinc- 
tion between sHpr and gEp. 

The studies now to be reviewed are ones which we interpret as throwing 
light on the ways in which expectancy, drive, and incentive value interact. 
Once an expectancy can be assumed to have been established, i.e., learning 
to be complete, the strength of the organism’s dispositions to respond can be 
related to (a) type or intensity of D at the time of testing or (b) difference 
between training and test conditions with respect to Si, M, and S,* (general- 
ization with respect to sign, means, and significate). In this section we shall 
attempt an interpretation of the experimental data as regards these deter- 
minants of performance. 


DRIVE AND PERFORMANCE 


Intensity of drive—Hillman, Hunter & Kimble (71) trained rats under 
two degrees of thirst in a 14-unit maze. No differences in error curves ap- 
peared, but the stronger deprivation group ran faster. A third and a fourth 
group were started on weak and strong deprivation respectively and then 
each shifted to the other deprivation. The error curves were not different 
from those of the first two groups. But the shifts from weak to strong motiva- 
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tion and vice versa caused respectively faster running and slower running. 
Deprivation affected performance but not acquisition. Variation in the 
strength of an irrelevant drive at the time of testing may also affect perform- 
ance. Brandauer (19) trained rats in a Skinner-type box under one degree of 
hunger. He then divided his subjects into three groups and extinguished 
under three degrees of thirst. The greater the thirst the more persistent the 
bar pressing performance. These results supported Webb’s earlier findings 
(178). Kendler e¢ al. (82) taught rats to make a given turn in a T-maze to 
obtain food pellets. The rats were then subdivided into two groups, one 22 
hr. thirsty and the other 3 hr. thirsty. When tested under these thirst condi- 
tions, significantly more of the 22 hr. thirsty group turned away from the 
food side. The greater the thirst the greater the negative incentive value of 
the expected food and the less the performance of going to it. (Kendler’s own 
interpretation is in terms of theincompatibility of anticipatory goal responses). 

Kohn (89) and Berkun, Kessen & Miller (11) have reported two impor- 
tant related studies in which plastic fistulas were sewn into rats’ stomachs. 
Using rate of drinking milk (11) or rate of panel pushing for milk (89), they 
found that, in general, prefeeding with milk through the mouth was the 
most effective, injection of the same amount of milk directly into the stomach 
next most effective, while injection of the same volume of isotonic saline 
solution was least effective in reducing further drinking or panel pushing. In 
our interpretation of these results these three types of prefeeding reduced 
the drive to different degrees, and these resultant differences in drive inten- 
sity correspondingly influenced performance. (Ina further study in this series 
Miller & Kessen (115) showed that milk injected into the stomach served as 
adequate reward for the learning of a T-maze even though it was less effective 
than milk taken by mouth.) Kaplan (80), in the study already mentioned 
above in which the rats were trained to turn off a noxious light, also tested 
the effects of lights of different intensities, 0 mL. to 2400 mL. Rate of re- 
sponding increased rapidly from 0 mL. to about 500 mL. but decreased some- 
what with lights from 500 mL. to 2400 mL. A too intense light caused less 
performance. Apparently too strong a drive stimulus had a depressant effect. 
Campbell & Kraeling (28) ran rats on an electrified straightaway in which 
the voltage in the start and middle sections was high and in the goal section 
lower. They found that the greater the drop from the same initial voltage the 
greater the running speed. Secondly, if the goal-box voltage was zero, all 
groups performed with the same high speed, irrespective of the starting-sec- 
tion voltage. Thirdly, a constant decrease in voltage between starting sec- 
tion and goal section had less effect when both start and goal voltages were 
high than when both were low. That is, the effect on performance of one and 
the same drop was greater as between two low voltages than as between two 
high voltages. (A Weber phenomenon?) 

Type of drive-—Amsel & Roussel (6) taught hungry rats to run a straight- 
away to food in two successive goal boxes. After the running speeds had 
stabilized the rats were then “‘frustrated”’ in the first goal box on half their 
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trials by not finding food in it. They definitely ran faster and showed lower 
latencies in leaving this first box on the frustration trials. The authors con- 
clude that frustration in the first box was a new added drive which enhanced 
the performance of going to the second goal box. Zeeman & Radner (184) 
taught rats to cross the center of a tilting box to turn off light but could not 
teach them to cross to turn on light. It would seem plausible to us to assume 
that both expectancies concerning the appearance or nonappearance of the 
light were equally well established. But only the drive to get away from light 
produced performance. 

It should be noted that some of the “‘latent-learning”’ studies summarized 
above might also have been considered with this group of studies. For in 
many of them the final test runs consisted in introducing a new D, appropri- 
ate to the previously irrelevant goal object, and when the new D was intro- 
duced the incentive value of the irrelevant goal became positive and the per- 
formance of going to it appeared. 


GENERALIZATION 


We may turn now to studies of generalization. In terms of our analysis 
such studies investigate the effect of differences between training and test 
conditions as regards S;, M, and S2*. 

Generalization with respect to S,;—These are studies which have been 
treated conventionally under the heading of stimulus generalization. Brush 
et al. (20) trained pigeons in the Skinner box to peck at a lighted circle 1.4 cm. 
in diameter. The birds were then tested on circles ranging from 2.6 to 0.2 cm. 
in diameter. A bell-shaped generalization curve (with the peak at the train- 
ing size) was obtained. There was somewhat more generalization to the larger 
sizes. Kling (88) ran rats in a Grice alley to white discs. Each group was 
trained on successive trials to two different areas (both positive). Subgroups 
were extinguished on a single large area or on a single small area. The to-be- 
expected generalizations were found. Rosenbaum (138) studied generaliza- 
tion along the dimension of height under different strengths of an irrelevant 
drive (namely, anxiety). Training human subjects to make a voluntary move- 
ment of a bar to a visual rectangle, he induced different degrees of ‘‘anxiety”’ 
by giving strong shock, weak shock, and buzzer to different groups, if the 
responses were not quick enough. He tested on rectangles of other heights and 
the to-be-expected generalizations for response frequencies and response 
amplitudes were obtained. Strong anxiety (strong shock) produced the most 
generalization. Different results were however, obtained for latencies. The 
more different the test rectangle, the shorter the latency. This last finding 
is puzzling. May it be that responses to the remote test stimuli were made 
only when the subjects did not take time to make the required discrimina- 
tion? Kimble & Kendall (86) studied, with rats, the effects of changes in S; 
in an extinction setting. They extinguished a ‘“‘wheel-turning” response which 
turned off shock and which had been conditioned to a strong light as the sign 
for coming shock. They found that, if they began the extinction with a weak 
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light, increasing it by gradual steps, none of which, when introduced, pro- 
duced much avoidance, extinction was more rapid than with the use of the 
original stimulus at full strength. This was conceived as a validation of 
Guthrie’s (62) ‘‘toleration method”’ for extinction. 

Generalization wlh respect to M.—There have been almost no experiments 
in this area. Studies falling here would be ones testing new maze routes (short 
cuts, Umwege), or new time-intervals, or the like, to discover whether and 
how the disposition to respond supported by the original M will generalize 
to a new M or new M’s. One recent study which might, however, very tenta- 
tively be classed here would be that of Notterman, Schoenfeld & Bersh (124). 
They extinguished a conditioned heart-rate response in human subjects by 
three methods: Group I had no instruction; Group II the instruction ‘“‘now 
no shock will follow’; Group III the instruction ‘‘tap electric key and no 
shock will follow.”” Group III extinguished most readily, Group II next and 
Group I not at all during the 11 extinction trials. The provision, by means of 
the different instructions for the different groups, of different new M’s during 
the waiting period caused different degrees of generalization. 

Generalization with respect to S2*.—There seem to have been no studies 
during the year which fall under this heading. Experiments concerning this 
problem would be ones which after acquisition varied the nature or amount 
of S;* and related the subsequent dispositions to respond (performance) to 
such variations. Such data could also be used to gauge the relative incentive 
values of different S,*’s. The extinction situation represents the extreme case 
in which the new S,* has no incentive value and in this case the generalized 
readiness to respond is always lost in time. 


PERFORMANCE IN THE PRESENCE OF ALTERNATIVE EXPECTANCIES 


Several expectancies can be established in the same situational setting. 
So far we have discussed the relation of performance to expectancy as gov- 
erned by drive and conditions of generalization. Several experiments suggest, 
however, that where alternate expectancies are built up the correlated dis- 
positions to respond are affected by additional determinants. 

Fixation.—We conceive fixation as the situation in which performance 
according to an early acquired expectancy interferes with performance ac- 
cording to a subsequently acquired expectancy. Wilcoxon (181) used special 
reinforcement schedules in the Maier “frustration” setup (111) to induce a 
strong expectancy highly effective in performance. His findings suggest that 
fixation is to be explained not as “frustration-instigated” behavior but as 
due to the dominance of the response disposition correlated with the original 
expectancy. It would appear that such factors as the special reward sched- 
ules and primacy of acquisition lead to the development of strong and re- 
sistant response dispositions which later interfere with the appearance of the 
new more appropriate response disposition, even though there was evidence 
in terms of “lookings” and “hesitations” that the corresponding new es- 
pectancies had actually been acquired. 
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Regression.—Aebli (4) trained rats to go to their preferred turn or to their 
nonpreferred side. He then shocked them just prior to choice. Those trained 
on the preferred side continued to choose that side; those trained on the op- 
posite side “regressed” to the preferred side. He concludes that regression is 
to an unlearned preference. 

Alternation.—Where two different expectancies for which the S,*’s have 
equal incentive value are built up, as, for example, in a T-maze where both 
arms lead to food, animals show a tendency to alternate between sides. The 
following investigations are concerned with discovery of the conditions which 
favor or reduce such alternations. Riley & Shapiro (132) tested in a T-maze 
the effect of light versus heavy weights on doors and also the effect of massed 
versus distributed trials. Massed trials clearly produced more alternation. 
Heavy weights gave only slight evidence of producing more alternation. 
Rothkoff & Zeaman (139) tested amount of alternation ina simple T, as af- 
fected by preceding forced trials to one side, by changes of intramaze cues 
and by changes of extramaze cues. They found a tendency to go to a new 
“‘place’’ as well as to make a new ‘‘turn.” They say “‘. . . the data did not do 
violence to a composite ‘tired stimuli and tired response’ type of theory.”’ 
Montgomery (118), continuing previous studies, used the T-maze with op- 
posite starting stems, and found that the rats were alternating with respect 
to goal places but not with respect to turns. Goodman, Moyer & Bunch (54) 
measured alternations in a simple T-maze filled with water in a nonhungry 
group of rats, a group subjected to shock, and a group subjected to airblast. 
No differences in tendencies to alternate appeared. We conclude that alter- 
nation is an expression of the alternate ‘‘fatiguing,’’ not of muscles (Jz) but 
of response dispositions for, say, going to light versus going to dark or for 
making right-hand turns versus making left-hand turns. 


POINTER-READING RESPONSES 


Percentage correct responses, running times, rates of lever pressing, ampli- 
tudes, latencies, trials to extinction, etc., have all been used as pR’s? and 
have been assumed to be highly correlated. But many experiments make this 
assumption of high intercorrelations doubtful, and this lack of consistency 
among measures continues to pose a serious problem for any system (ours 
included) which attempts to use them interchangeably as dependent varia- 
bles. Two recent experiments illustrate this lack of consistency among 
measures. Hall & Kobrick (63) measured the strength of a runway response 
in terms of latency, running time, and resistance to extinction. Latency and 
running were significantly but not highly correlated, but neither of them was 
correlated with resistance to extinction. Logan (105) reanalyzed the data of 
his experiment mentioned above (104) in which he used a double-bar Skinner 
box. The two bars had different delays of reward (1 sec. and 5 sec.). He had 
three measures: preferential latency for the short-delay bar, per cent of 
choice of the short-delay bar, and the difference between the response speeds 
to the two bars presented singly. He found good correlation between differ- 
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ential response speeds and preferential latency but no significant relationship 
between either preferential latency or differential response speeds and per cent 
choice of short-delay bar. 


VERBAL LEARNING 


The theoretical framework for most studies of verbal learning has re- 
mained associationistic. The conditions governing the formation of associa- 
tions among the members of a verbal series continue to claim experimental 
attention. From the general point of view represented in this review, such 
studies serve to specify the conditions under which exposure to conjunctions 
of verbal stimuli serves to establish expectancies concerning such conjunc- 
tions. 

Meaning and familiarity—The related variables of meaningfulness and 
familiarity are stressed in several publications. Noble (121) has presented a 
theoretical analysis of verbal meaning in S-R terms and devised an index of 
meaning based on the number of different associations elicited by a given 
word. Using this index, Noble (122) then investigated the role of stimulus 
meaning in serial verbal learning. A number of performance criteria (pointer- 
reading responses) showed a direct relationship between degree of stimulus 
meaning and learning proficiency. The finding of Kurtz & Hovland (92) that 
verbalization during observation of stimulus objects increases accuracy of 
recognition and recall may be mentioned in this connection. The verbaliza- 
tion may be assumed to increase the richness of the associative context 
aroused by the stimulus objects. 

Variations in meaning and familiarity produce parallel effects. Hovland 
& Kurtz (73) report that familiarization with nonsense syllables prior to 
learning speeds up acquisition. Such familiarization reduces the number of 
trials to mastery by a more or less constant amount for lists of different 
lengths. The results are in conformity with Robinson’s Law of Acquaintance. 
In terms of information theory, familiarization reduces the amount of in- 
formation conveyed by each syllable in the list. 

Information theory is also applied by Aborn & Rubenstein (2), who in- 
vestigated the influence of previously learned rules of organization on the 
immediate recall of nonsense syllables. The rules were such as to produce 
systematic variation in the amount of information per syllable. The lower 
the rate of information the more syllables were recalled. 

The effects of previously learned associations on recall were demon- 
strated by Jenkins & Russell (78). Stimulus and response terms from the 
Kent-Rosanoff list were scattered at random through a list of words. Recall 
tests showed a strong tendency for the Kent-Rosanoff pairs to be recalled 
together. In the recall, the S-R sequence of the Kent-Rosenoff test was also 
maintained. 

Conditions of practice and testing—Under this heading, we shall make 
brief reference to a somewhat heterogeneous group of experiments which have 
varied conventional conditions of practice or testing or both in investigations 
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of serial association. Meyer & Miles (114) investigated changes in the process 
of rote learning which result from continued practice (20 successive lists). 
They were unable to find systematic changes in the mathematical properties 
of learning curves at different stages of practice, and there is scant evidence 
for qualitative changes in the learning process. 

An interesting variation in conventional procedures of serial learning is 
introduced by Deese & Kresse (33) in their study of the types of errors which 
account for the classical bow-shaped serial position curve. To gauge fully the 
response dispositions at different points in the series, subjects were given 
unlimited amounts of time for anticipation of each syllable. Two types of 
errors summate to form the serial position curve: a distribution of intralist 
intrusions symmetrical around the middle of the list, and failure-of-response 
errors which rise to an asymptote in the second half of the list. 

Turning to conditions of testing, we note the failure of Rockway & 
Duncan (136) to find any evidence for pre-recall warming up in retention 
tests for serially learned verbal materials. They made every effort to dupli- 
cate the procedures of earlier investigators but could not reproduce the re- 
sults. Another ‘‘now-you-see-it-now-you-don’t” phenomenon! Also on the 
negative side are the results of Grings (60) who attempted an experimental 
test of the “extinction” theory of verbal forgetting proposed earlier by 
Underwood (169). Varying amounts of distraction introduced during the 
retention test failed to produce ‘‘disinhibition,” i.e., did not lead to recovery 
of presumably extinguished verbal responses. 

Distribution of practice and reminiscence-—The phenomena of distribu- 
tion and reminiscence appear in both motor and verbal learning. There are 
important differences between the two areas, however, as regards both ex- 
perimental facts and theoretical interpretations: (a) the range of conditions 
over which distribution makes a significant difference appears to be much 
more circumscribed in verbal than in motor learning; (b) the theoretical 
construct of reactive inhibition which has carried the main burden of theo- 
retical interpretation of distribution effects in motor learning can be applied 
to verbal learning phenomena only with considerable difficulty. Over the 
last two years or so, Underwood has been conducting a systematic investiga- 
tion of the effects of distribution in verbal learning. Several of these studies 
(170 to 174) appeared during the current year. His results emphasize the 
dependence of distribution effects on very specific experimental conditions 
and the lack of a general theory under which the experimental findings can 
be subsumed. 

Among Underwood’s findings we may briefly mention the following: (a) 
the activity used to fill the rest intervals may significantly influence the 
effects of distribution; thus the conventional activity of color naming induces 
a responding set which raises the frequency of errors as well as the frequency 
of correct responses (170); (6) whereas distribution has a beneficial effect on 
the learning of serial nonsense lists, no such influence is found with paired 
nonsense syllables (171, 172); (c) similarly, with meaningful materials (ad- 
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jectives) serial learning is improved by distribution whereas paired associ- 
ate learning is not (173, 174). It is important to note that careful analysis 
of the wide variety of learning and retention data obtained by Underwood 
fails to lend consistent support to any one theory of distribution, such as 
the reactive-inhibition theory or the differential-forgetting theory. Gibson’s 
generalization hypothesis (51) also fails to handle some of the major findings 
of these studies. The theoretical analysis of distribution of practice in verbal 
learning is still a wide open problem, as was also found to be true for classical 
and instrumental conditioning. 

The picture presented by recent studies of reminiscence in verbal learning 
is equally inconclusive. Using serial lists of nonsense syllables Archer (7) 
failed to find any evidence for reminiscence; variations in the association 
value of the syllables, the length of the rest interval, and in the rest-interval 
activity did not yield any reminiscence effects. The results flatly contradict 
earlier studies. Riley (131), on the other hand, obtained evidence for reminis- 
cence in paired-associate learning with nonsense syllables when competing 
responses are attached to the stimulus terms. The results are at variance 
with a differential-forgetting theory and are consistent with a work-decre- 
ment hypothesis, although the exact nature and conditions of such decre- 
ments in verbal learning remain uncertain. In accordance with earlier studies, 
no reminiscence effects were found by Noble (122) in the serial learning of 
meaningful words. The description of reminiscence as a “‘now-you-see-it- 
now-you-don’t” phenomenon (26) appears to be as apt as ever. 

The law of effect in verbal learning—Much of Thorndike’s experimental 
evidence for the law of effect was obtained in verbal learning situations. Im- 
plications of the reinforcement hypothesis continue to be tested in such 
situations. Thus the problem of “learning without awareness” has been 
reopened. Thorndike had argued that such learning would be evidence for 
the automatic effects of reinforcement. Evidence for discrimination learning 
without awareness of the principle of solution is reported by Walk (176). 
Postman & Jarrett (129) found only scant evidence for learning without 
awareness in the acquisition of an associative response tendency. Their 
results do not, however, support invocation of a principle of insight. Skill in 
the application of a verbalized principle showed a gradual increase. 

The spread of effect, originally offered as critical evidence for the law of 
effect, continues to be reanalyzed. Sheffield & Jenkins (147) offer new evi- 
dence in support of their argument that the guessing-sequence hypothesis 
is adequate to account for the spread phenomenon. In the absence of reward, 
duplication of other features of Thorndike’s situation (e.g., fixed order of 
stimuli, set for recall of word-number connections) yields levels of response 
repetition equal to, or exceeding those obtained by Thorndike. 

The role of anxiety in serial rote learning is analyzed by Montague (117) 
from the point of view of reinforcement theory (cf. similar studies by Spence 
and associates (40, 157). Anxiety, as a drive, is assumed to multiply with 
habit strength to determine excitatory potential of correct as well as incor- 
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rect responses. Anxiety should aid performance if correct response tendencies 
predominate and should hinder performance in the presence of incorrect 
competing tendencies (which are enhanced by the drive along with the cor- 
rect ones). This implication was verified in a design employing nonsense- 
syllable lists of varying difficulty. Anxious subjects performed better on the 
easy lists and were surpassed by the nonanxious subjects on the difficult lists 
involving many incorrect response tendencies. 

Failure and related motivational factors—Verbal materials have frequently 
been used in the investigation of motivational factors in learning and reten- 
tion, particularly to test the concept of repression or hypotheses stemming 
from it. Experimentally induced failure is one of the techniques used to 
create motivational conditions appropriate to the testing of such hypotheses. 
Several studies of this type appeared during this year. Eriksen (39) compared 
memory for completed and incompleted tasks by ego-involved and control 
subjects. Ego-involved subjects (a) recall relatively more completed tasks 
and (b) show greater variability than do control subjects. Eriksen interprets 
his data as demonstrating the importance of ego-defense in memory. Aborn 
(1) compared the relative effects of experimentally induced failure on ma- 
terials learned intentionally and incidentally. Ego-threat had significant 
effects on memory for the incidentally learned materials only. In the case 
of incidental memory, such factors as rehearsal and overlearning presumably 
do not counterbalance the motivational effects. Murdock (120) investigated 
the effects of both failure and retroactive inhibition on mediated generaliza- 
tion. Both types of treatment served to reduce the amount of mediated 
generalization: failure by leading to the “‘suppression,’’ and retroactive in- 
hibition by producing extinction, of the mediating verbal responses. 

An implication of repression theory was tested by Laffal (93) who com- 
pared words which produce association disturbances and words which do 
not with respect to speed of learning and retention. He found (a) slower 
acquisition of the disturbing words, and (6) no differences in retention for 
the two types of words. These contradictory findings lend only indifferent 
support to a repression hypothesis. 

We feel that the effects of failure, stress, and related conditions on learn- 
ing and retention can for the present be adequately and parsimoniously 
handled by the more mundane concepts of current learning theories. The 
invocation of ego defense or repression seems to put too heavy a conceptual 
burden on rather slender empirical facts. 

Social attitudes—The danger of erecting precarious theoretical super- 
structures on limited and uncertain data seems to us illustrated in a study 
of Rokeach (137) concerned with attitude as a determinant of distortion in 
recall. Low- and high-prejudiced subjects were shown photographs of whites 
and Negroes with fictitious names attached to them. In a recall test, the 
high-prejudiced group recalled certain names (Gray, Greene, Lynch) more 
often than the low-prejudiced group to both Negro and white faces, whereas 
the low-prejudiced group was partial to the name Jones. To explain these 
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rather puzzling results, the author invokes the following concepts: Freud’s 
theory of slips, a continuum of cognitive reorganization ranging from assimi- 
lation to various degrees of compromise and contrast, tolerance af ambiguity, 
tolerance of incongruity, closure! The ratio of concepts to data appears 
somewhat high. 

Doob (34) investigated the effects of attitude on incidental memory for 
controversial items and found quantity of recall to be associated with in- 
tensity of attitude. In the same investigation, the effects of serial position 
on items evoking different attitudes were studied. In the case of items evok- 
ing strong drives, primacy increases amount of recall; in the case of items 
evoking weak drives, recency is more effective, probably by reducing the 
amount of retroactive inhibition to which the items are subjected. The 
picture presented is highly complex but we note with interest the attempt 
to apply systematic learning principles to memory data obtained under 
naturalistic conditions. 

Perlmutter & de Montmollin (128) compared the performance of groups 
and individuals in a rote-learning task. Groups were found to be superior to 
average individual learners in total scores, but the best individual performers 
surpassed the group. The learning performance of individuals was improved 
by prior participation in group work. In a further study, Perlmutter (127), 
using Bartlett’s method of repeated reproduction, compared the products of 
individual and group memory. He concludes that some of the group products 
are not derivable from the individuals’ products. The reviewers express their 
bewilderment at the implied resurrection of the unlamented late group 
mind! 


MOTOR LEARNING 


For an expectancy theory, specific motor acts serve as ‘“‘pointer readings” 
for inferring the organism’s cognitive dispositions. Variations in motor 
behavior would, therefore, be treated primarily under the general heading of 
performance. However, expectancy theory has not developed a systematic 
description of motor learning. For an S-R reinforcement theory, on the other 
hand, motor learning provides important opportunities for testing hypoth- 
eses about both learning and performance. Much of the current experi- 
mental work in motor learning centers around the implications of Hull’s 
concepts of reactive inhibition (Jr) and conditioned inhibition (sZr). The 
specific experimental designs are concerned primarily with the effects of post- 
practice rest intervals on the acquisition and performance of motor responses. 
The inhibition built up during practice is assumed to dissipate during the 
rest. Much effort is now being devoted to obtaining as precise information 
as possible concerning the conditions under which inhibition grows and 
dissipates and the quantitative laws of these phenomena. Without reviewing 
these studies in detail, we shall consider some of the main experimental 
problems and conclusions. 

Distribution of practice—Several studies have compared acquisition and 
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performance of motor skills under distributed and massed practice. In 
general, spacing has a beneficial effect on acquisition and performance. The 
effects appear, however, to be transient. Thus Adams (3) reports that both 
his distributed and massed group were approaching the same asymptote of 
performance on the pursuit rotor. Using three different psychomotor tests, 
Reynolds & Bilodeau (130) found that distributed practice during acquisi- 
tion resulted in superior performance on all tests. On tests of retention 10 
min. and 24 hr. later the two groups become increasingly alike. Bilodeau (14) 
observed the effects of massing and distribution over an extended period of 
practice and found that performance depended more on the number of days 
of prior practice than on conditions of distribution. Kimble & Shatel (87) 
present data supporting Kimble’s previous analysis (84) of inhibitory 
processes in motor learning. According to this view, reactive inhibition de- 
creases with continuing practice since with declining motivation the subject 
will allow less and less Ig to accumulate before he rests. Conditioned inhibi- 
tion, on the other hand, is assumed to increase with practice in a negatively 
accelerated fashion. A review of the evidence supporting Kimble’s theory 
is presented by Wasserman (177). 

Reminiscence-—The reminiscence experiment is, of course, a special case 
of distribution of practice. The theoretical analysis of reminiscence effects 
in motor learning follows the same general line as the analysis of distribution 
effects: virtually all the investigators take their lead from the concept of 
reactive inhibition. A number of parametric studies has appeared, relating 
postrest gain to such variables as amount of prerest practice, duration of the 
rest period, and effortfulness of the task (e.g., 13, 36, 123). It would be dif- 
ficult to summarize the experimental findings in terms of a few generaliza- 
tions, beyond the fact that reminiscence does occur following massed prac- 
tice. The reviewers are impressed with the dependence of the functional 
relationships on the specific experimental conditions. 

Important questions concerning the interpretation of reactive inhibition 
are raised by a series of studies demonstrating reminiscence in bilateral 
transfer. Irion & Gustafson (77) found significant postrest gains in pursuit- 
rotor performance even though subjects shifted from right to left hand fol- 
lowing the rest intervals. Similar results are reported by Kimble (85) whose 
subjects shifted from nonpreferred to preferred hand following the rest. Find- 
ings such as these throw doubt on a conception of reactive inhibition as 
localized in specific effector organs. Stimulated by this problem, Grice & 
Reynolds (58) compared the effects of varying amounts of rest on ipsilateral 
and bilateral transfer reminiscence. For both conditions, postrest gains 
were an increasing function of the amount of rest. Although the ipsilateral 
group showed larger gains than the bilateral group the data suggests that the 
general shape of the function is the same. Grice & Reynolds ascribe the post- 
rest gains in bilateral transfer to ‘‘temporary inhibitory factors either asso- 
ciated with generalized postural adjustment or of a perceptual or central 
character.’ Such an interpretation seems to broaden the connotations of 
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reactive inhibition very considerably. This general conclusion is in harmony 
with that reached above in connection with the experiments on alternation 
behavior in rats. 

Rest intervals usually result in warm-up decrements which may be at- 
tributable to a loss of set or postural adjustment. Hence postrest perform- 
ance may be improved by warming-up activities during the rest interval. 
Silver (151) has systematically explored the effects of varying amounts and 
distribution of warm-up activity during a rest interval. Warm-up activity 
clearly improved postrest performance provided the amount of such activity 
did not itself produce too high a degree of inhibition. Distributed warm-up 
activity is, therefore, more beneficial than massed warm-up activity. 

Transfer—Motor learning is convenient for the study of transfer proc- 
esses because specific stimulus and response elements can often be identified 
precisely. Duncan (35) investigated transfer between two motor tasks as 
determined by two variables, degree of learning of first task, and intertask 
similarity. Degree of positive transfer was a direct function of both these 
variables. There was no evidence for interference between tasks. Both re- 
sponse generalization and “learning how to learn’”’ may account for the uni- 
formly positive transfer effects. Evidence for negative transfer effects 
appears, however, in studies of retroactive inhibition in motor learning. 
Lewis, Smith, & McAllister (102) studied retroactive inhibition in perform- 
ance on a two-hand coordinator when original and interpolated tasks re- 
quired a reversal of movements. They found amount of retroactive inhibition 
to vary with the degree of interpolated learning but, contrary to their ex- 
pectations, differential amounts of retroactive facilitation were not devel- 
oped. Using the same apparatus, McAllister (107) studied the effect of 
different degrees of original learning on retroactive interference. Interference 
increases as a function of original learning level, reaches a maximum and 
then appears to decline with overlearning of the original task (this finding 
is similar to results obtained with verbal materials). Relearning data showed 
the interference effects to be relatively enduring. 
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The physiology of vision has just been reviewed by Chang (48). Vernon's 
book on visual perception (229) is almost up to date, yet not enough so as 
to include the newer titles from her contribution to Volume 4 of this series. 
Her approach to perception while in sympathy with phenomenology is dif- 
ferent from that of Gibson, emphasizes individual differences, and is well 
balanced on the nativism-empiricism problems. The past year has brought 
publication of a number of specialized books on applied areas in vision, 
Volume 2 in the series by Linksz (143), Geldard’s general text (91) on The 
Human Senses, and several important new translations. Foremost among the 
latter is that by Boeder of Tschermak-Seysenegg’s Introduction to Physio- 
logical Optics (227). Now for the first time we can read in English a full dis- 
cussion of the physiological psychology of visual action. 

Middleton (153) represents the renewed enthusiasm for giving considera- 
tion to the human eye and its addenda for obtaining, storing, and evaluating 
information. Nonvisual aids such as radar have been oversold to the extent 
that we, the people, ‘are likely to under-rate the importance of actual see- 
ing,’ Middleton says. His bibliography includes many hard-to-find technical 
publications on outdoor seeing. Studies reviewed in the later sections of the 
present chapter show that increasing attention is being given the problem of 
how and when man makes an adequate evaluation of his visual environment. 
But first, what is new in our understanding of visual capacities and their de- 
velopment? 


Basic PSYCHOPHYSICAL FUNCTIONS 


Methods.—Verplanck, Collier & Cotton (230) question the general as- 
sumption that each trial in a psychophysical experiment on the visual 
threshold is an independent event. Using procedures modeled in several 
respects after ones employed by Hartline or by Hecht, they find that if S? 
reports ‘‘yes’’ after one flash he is more likely to do the same on the next 
trial, or conversely. Statistically significant dependence was found with as 
many as 10 responses intervening between the correlated responses. Theories 


1 The survey of the literature reported here covered the 15 months ending May 1, 
1953, The survey was done and the present chapter prepared during the author’s ap- 
pointment as Visiting Research Professor at the University of Rochester. 

2 The following abbreviations and symbols are used in this chapter: S (subject); 
cff (critical flicker frequency); ERG (electroretinogram); C.I.E. (Commission Inter- 
nationale d’Eclairage), also known in the United States as I.C.I. (International 
Commission on I!lumination); j.n.d. (just noticeable difference); RDP (receptor type 
distribution pattern). 
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of the threshold which have assumed independence may need to be re-exam- 
ined. Equally important is the question of what produces the interdepend- 
ence of contiguous groups of trials. Blackwell (22) is on the trail of this prob- 
lem. He suggests a number of procedures for keeping the subject at the peak 
of performance. Among these are: fairly extensive experience for the subject, 
forced choice, training on the procedure with knowledge of results, and a 
minimum of variation in the luminances used within a block of trials. 

Where verbal instruction and report cannot be used, as in work with 
young children or animals, the problem of obtaining peak performance is 
much increased. Fields (76) reports preliminary findings with a procedure 
that results in considerably better visual acuity scores for the white rat than 
have ever been obtained previously. Misumi (155) criticizes some earlier 
methods and offers data on the development of size constancy in human in- 
fants based on a preference-response for the larger of two objects. 

Absolute thresholds —Two independent studies deal with the distribution 
of scotopic sensitivity along various meridians of the visual field. Lange 
(137) measured along vertical, horizontal, and diagonal meridians to 30° from 
the fovea. He reports that sensitivity declines with age over the entire retina. 
His results agree with those of Riopelle & Bevan (191) in finding the fovea to 
have the highest threshold of any region (except the blind spot) out to 30°. 
However, the latter investigators mapped to 56° and along twice as many 
meridians and found for their eight Ss a region beyond 45° below the fovea 
that was even less sensitive than the fovea. Regions of maximum sensitivity 
were found on the horizontal meridian between 15° and 30° from the fovea, 
the temporal region being slightly more sensitive than the nasal. 

Davy (65) describes experiments on the J X¢ relationship for multiple 
flashes of light in the peripheral retina and finds that threshold energy is the 
same for rapidly successive flashes as for single flashes within a critical inter- 
val. Beyond 0.4 to 0.5 sec. of temporal separation, threshold energy is the 
same for each of two flashes as it is for one flash. At this point S first reports 
seeing two flashes consistently. Bouman (27) measured thresholds for line- 
shaped targets, 2’ in width and 2’ to 256’ long. Results agree with predic- 
tions from quanta theory. It is again concluded that two quanta must be 
absorbed within about 0.1 sec. and within a given area (Ricco’s region). For 
green flashes Ricco’s region subtends visual angles up to 32’, which is some- 
what larger than for circular flashes. For red flashes the limit is about 8’, as 
compared with 2 to 4’ in the case of circular test flashes. Intensity, size, and 
distance functions were measured by Kaneko & Obonai (126) in a study of 
areal interaction. With low intensities (up to eight times threshold value) 
the inducing stimulus facilitated perception of the test stimulus. Beyond 
this an inhibitory effect began. This was a negatively accelerated increasing 
function of the intensity and the area of the inducing stimulus. Walsh (237) 
could find no dependence of visual thresholds on the amplitude or the phase 
of the alpha rhythm at the moment of stimulation, thus failing to find sup- 
port for the interesting hypothesis that the alpha rhythm “‘scans”’ the striate 
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cortex. Reaction times also were not related to phase or amplitude of alpha 
at the moment of stimulation. 

Conditions of preadaptation were varied systematically in studies of the 
subsequent course of dark-adaptation by Mote and co-workers. Varying 
intensity and duration of pre-exposure through the range for which I Xt=C, 
they (159) find this relationship holds up to a value of 3,320 mL.-sec. Be- 
yond this, intensity becomes a more potent factor than duration. Their other 
studies (1, 157, 158) are concerned with the effects of intermittent pre- 
exposure. For low values of J X# neither rate of intermittence nor intensity 
variation disrupted the Bunsen-Roscoe relation. For larger values the hy- 
pothesis of Wald and Clark is found generally acceptable: Slower recovery is 
an expression of the regeneration of rhodopsin from the vitamin A stage, 
which becomes predominant over the regeneration from retinene. To obtain 
quantitative data for an analysis of various possible nervous and photo- 
chemical mechanisms in dark adaptation, Bouman (25) presented red or 
green test flashes of various sizes during 0.5 sec. interruptions of either red or 
green adapting fields. Ricco’s region, where integration of energy occurs, is 
found to diminish with increases in preadaptation level, but only for green 
light. ‘‘For red flashes even for the dark adapted eye Ricco’s region hardly 
exceeds the size of one receptor’’ (25, p. 950). This is a remarkable result for 
test patches 7° toward the periphery of the visual field. 

Wald (232) fails to find that pre-exposure to ultraviolet light results in a 
prolongation of dark adaptation time. He does confirm the fact that ultra- 
violet is an adequate stimulus for the aphakic eye. Other work on spectral 
sensitivity is reviewed below under color discrimination. 

Differential brightness thresholds—Holding adaptation levels constant at 
various points in the luminance scale, Kern (128) explored brightness dis- 
crimination functions. Maximum sensitivity, B/AB, for each function occurs 
at the respective light intensity to which the eye is adapted. An absolute 
maximum is found at one Apostilb (107 mL.). At low background bright- 
nesses the contrast threshold data are considered to be well described by the 
two-quanta theory (26), and independence of the adapting processes in each 
eye is found to be complete when variation of pupil size is eliminated. Using 
rod vision 7° nasal from the fovea, Bouman & Blokhuis (28), studied the 
visibility of black against an illuminated background. They conclude that 
response is possible if two quanta are intercepted from the background 
stimulation at the retina. A 60 per cent chance of seeing criterion was again 
employed. Difference thresholds for two luminous points as a function of the 
separation between them were studied by Flamant (78). A separation of 10’ 
to 15’ was found to be optimum. With a constant separation of 10’, sensitiv- 
ity was measured for various positions on the retina between the fovea and 
2° nasally and temporally. Flamant found maximum sensitivity at the center 
of the fovea for red and green monochromatic light. A range between 10’ 
and 40’ from the center proved most sensitive for blue. 

In accordance with his alternation of response theory, Bartley (14) finds 
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brightness enhancement at certain rates of intermittency to be susceptible 
to the effects of fluctuating or steady stray light. Fry & Alpern (87) report 
data confirming the stray light hypothesis and denying interaction mediated 
by retinal mechanisms as the explanation for effects of peripheral glare 
sources upon apparent brightness. Reduced receptor sensitivity produced by 
whole-body x-irradiation (369 to 469 r’s) is held responsible (88) for changes 
in the performance of rats on brightness discrimination problems. 

Both Brindley (33) and Rouse (195) report studies of the Bunsen-Roscoe 
Law at supra-threshold levels. Brindley used white light and a large range 
of times and intensities. For flashes giving a sensation of equal brightness 
the product of time and intensity was found to remain equal for durations 
from 1.54X10-* to 4.11X10~" sec., and for intensities up to about 3108 
photons (trolands). Within this range of duration flashes that appeared equal 
at the fovea also appeared equal in the light-adapted and the dark-adapted 
periphery. Rouse, using central fixation, studied the intensity-time relation- 
ship necessary for differential thresholds of red, green, and blue flashes on a 
surrounding white-lighted field. The critical duration was the same for all 
wave lengths. There is no evidence here for temporal or structural differences 
in the retinal processes associated with color vision. 

Flicker-fusion thresholds were measured by Lloyd (142) for various foveal 
and peripheral areas. For areas with diameters of 1° or 2° thresholds were 
similar for fovea and periphery (20° below fovea). At low intensities the 
periphery was slightly more sensitive, and at medium intensities the fovea 
had higher cff?. The larger field (2°) produced higher cff in both locations. 
For 6° and 14° peripheral areas there were still further increases. Erlick & 
Landis (75) find lower cff with advance in age, which they attribute chiefly 
to a lengthened restoration period during the dark phase. Peckham & Arner 
(173) report that they could predict changes in acuity from changes in cff 
measured morning and evening on college students driving automobiles in 
the Arizona desert. 

Binocular flicker has been studied again in a comprehensive research by 
Baker (11), and again Sherrington’s conclusion that each eye acts independ- 
ently is disproved. Baker also reports variation in cff with size of field within 
the fovea. There is a limited flexibility in the correspondence of position on 
the two retinas which must be stimulated for binocular interaction to occur. 

Acuity.—Using a black Landolt C on an illuminated background, Pirenne 
& Denton (176) have extended Schlaer’s acuity-intensity function down to 
—4 Trolands or Log —6 candles/M?*. Brief exposures were used. For a wide 
range of intensities the quantity of light striking the retinal image of the 
smallest detail detectable is of the order 140 to 900 quanta/sec. The mini- 
mum number of quanta absorbed by the receptors per unit action time (0.1 
sec.) is between 2 and 10, whether absolute threshold or acuity is taken as 
the index of seeing. Barlow (13) reports that, although individual fixation 
pauses show very little movement (root mean square deviations of 0.25’ of 
arc during 0.4 sec.), successive fixation points scatter around a mean position 
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with a root mean square deviation of 5’ of arc. Ditchburn & Ginsborg (69) 
find similar amplitudes of physiological nystagmus. Ratliff (185) shows 
that eye movements and physiological nystagmus hinder rather than aid 
monocular acuity. 

Effects of pupil size on acuity are reported by Leibowitz (138), whose 
data indicate that the natural pupil is normally too dilated for maximum 
acuity in the range of .01 through 1 mL. of field luminance. Groot & Gebhard 
(102) have evaluated, weighted, and combined the available data on pupil 
size as a function of adapting luminance. They present a very useful table on 
retinal illumination for the average eye with known field luminance which 
incorporates corrections for the Stiles-Crawford effect. By pharmacological 
means (adrenalin and cocaine) in normal persons and in persons with Hor- 
ner’s syndrome, Monjé (156) studied the neural mechanisms that control 
accommodation. He concludes that control is exerted by oculomotor and 
sympathetic centers through labile tonus-maintaining mechanisms, that 
there is no relaxed position, and that age, heterophoria, and other factors 
related to individual differences all contribute to accommodation responses. 
Fry (83) discusses some psychological factors in visual acuity and believes 
that improvement in the identification of blurred retinal images can result 
from practice, and, furthermore, that “‘an effort to see’’ may improve acuity 
by producing changes in accommodation, pupil size, or intraocular pressure. 
Bahr (9) agrees that the problem of measuring the depth of focus of the eye 
is extremely difficult. Acuity criteria and type of illumination are highly sig- 
nificant parameters. Bahr reports data on one aphakic S with a constant cor- 
rection of +13 diopters who showed the required acuity through a range of 
2 diopters under daylight fluorescent and 5 diopters with a mercury lamp. 
Wulfeck’s normal subjects showed a similar range of depths of field (242). 
Using an infrared double spot light source, Wulfeck photographed the third 
Purkinje image to find average accommodation times varying from 0.24 to 
0.45 sec., depending upon target size and whether near-far or far-near shifts 
were required. 

Scotopic or night visual acuity has received careful study in the experi- 
ments of Brown et al. (34). Their test fields were gratings subtending 7.3° 
of visual angle. Improving acuity as a function of dark adaptation is seen 
only as a single cone curve when measured with fine gratings, but for coarser 
gratings rod portions of the curves appear after 10 to 15 min. Fry & Alpern 
(86) report an investigation of the effects of flashes of light on scotopic acuity. 
Flashes were varied in brightness and duration, and acuity was measured in 
terms of the width of a black line necessary for visibility against a background 
luminance equated to that of moonlit sky. Pupil size was held constant (2 
mm. artificial pupil). Several basic principles evolved, the most general being 
that the effects of flashes which did not directly strike the part of the retina 
used in viewing the acuity target can be accounted for in terms of stray light. 
There was no spread of adaptation processes from one part of the retina to 
another. To help determine the part played by pupillary responses to light 
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flashes, Fry & Allen (85) report the first studies of many that will have to be 
done before accurate predictions can be made of the course of pupillary con- 
striction and recovery. As a limiting case it is found that 3 sec., 8°, centrally 
fixated flashes of high brightness permit full recovery within the first minute 
following the flash. Koomen, Scolnik & Tousey (131) measured accommoda- 
tion in dim light and in darkness by photographing the Purkinje images of 
two spots of light. Subjects remained focused for far vision or in one case 
accommodated about 1 diopter, which was taken as evidence that night 
myopia is a result of spherical aberration with increase in size of pupil, 
rather than a result of an accommodative response. 

Color discrimination.—Outstanding among a number of new books on 
theoretical and applied aspects of color are those of Judd (124) and of The 
Optical Society of America Committee (170). In the latter, Newhall was pri- 
marily responsible for excellent chapters, with good bibliographies, on psy- 
chological processes in color vision. The Committee now abandons the posi- 
tion taken by the 1922 Committee on Colorimetry, which defined color as a 
sensation. It is clear from the Introduction that this abandonment in favor 
of a psychophysical point of view was not achieved without a struggle. In 
addition to chapters on methods in these new books, there are several papers 
which make important methodological points or describe new apparatus. 
The representativeness of the C.I.E.? Standard Observer is questioned (119, 
123). Nickerson (166) and co-workers present a table of C.I.E. tristimulus 
values with Munsell renotations for all Munsell standard papers made to 
July, 1950. A recheck and extension of data on the spectral energy distribu- 
tion of light sources A, B, and C are provided by Davis et al. (64). Burnham 
(42) describes a new colorimeter, and Schmidt (199, 200) discusses proce- 
dures for testing color blindness. 

In a statistical study of color-matching Brown (35) finds normal distribu- 
tions about a color center, and the magnitudes of one-standard-deviation 
ellipsoids are shown to be smaller than those misrepresented as such in earlier 
work. By contrast with the fineness of hue discriminability, the number of 
absolutely identifiable hues is astonishingly small. Halsey & Chapanis (106), 
in order to minimize somewhat the effects of habitual color names, taught 
subjects identifying numbers for spectral hues. Presenting the hues one at a 
time and using a high criterion of accuracy, they found that there were only 
10 or 12 absolutely identifiable hues through the spectral range for which 
there are more than 150 hues that can be discriminated by direct comparison. 

Several researchers have measured effects of chromatic surroundings, of 
adaptation, or of field size on color perception. Brown (36) finds that 2° 
fields give poorer discrimination and are more affected by surrounding colors 
than are 12° fields. Burnham (41) reports that larger areas of color appear 
brighter and more saturated than small ones of the same physical intensity. 
Hunt (116), using a binocular matching technique, shows that maximum 
saturation is obtained with low levels of adapting intensity and high test- 
color intensity. He reports that the dark-adapted eye sees a pale blue rather 
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than colorless light. Burnham, Evans & Newhall (43) also had their observers 
match fields presented separately to the two eyes. A change from an adapting 
artificial daylight to tungsten light produced a shift in the perceived colors 
toward the blues on the order of 20 j. n. d.2 The converse produced a shift 
toward the yellows. Jameson & Hurvich (123) measured the spectral sen- 
sitivity for a 1° foveal field as it depends on the state of adaptation: dark or 
light neutral, blue, green, yellow, or red. A slight Purkinje shift was found 
for the small central field, which the authors prefer to explain on other than 
a rod-participation basis. A prominent shoulder at 600 to 610 my almost dis- 
appears with red adaptation. The data do not fit the commonly cited tri- 
chromatic response curves, but other curves could presumably be made to 
agree with the results and still remain within the general framework of three- 
color theory. 

Frank defect in color vision and variability in presumed normals are 
active topics of investigation. Wright (241) has given a fascinating account 
of his discovery of about 40 tritanopes through publication of a color chart 
in a picture weekly. One near monochromat was also found. On the basis of 
numbers responding Wright estimates an incidence of tritanopia somewhere 
between 1 person in 13,000 and 1 in 65,000. Genetic transmission seems to 
differ from that for protonopia or deuteranopia, but workup of family lines 
is still in progress. All tritanopes had “‘large field” defects of which they were 
quite aware. According to Halsey & Chapanis (107) protanopes give matches 
with blue-green standards that diverge markedly from Judd’s predictions 
based on Pitt’s data for spectral colors. Matches by deuteranopes and the 
neutral matches by protanopes agree well with Judd’s predictions. Constants 
for isochromaticity lines are given. These authors (49) report that photopic 
thresholds for red light (640 my) in an unselected sample of color-deficient 
individuals vary in a fairly continuous distribution, there being only a slight 
indication of bimodality. Burnham (41) finds marked individual differences 
in components selected on a three-knob colorimeter to match Munsell colors, 
which he believes are attributable to differences in sensitivity. Ishak (119, 
120) also reports variations from individual to individual for tristimulus 
values of the spectrum. He feels that the C.I.E. chromaticity coordinates are 
based on too few observers. Egyptian and British observers are found not to 
differ significantly in their means for the red coordinates, but for the green 
the mean is significantly higher and for the blue lower in the Egyptians (119). 
Differences in densities of ocular pigmentation are ruled out. 

Walls & Mathews (236), from an extensive review of some color vision 
literature and their own data, come out with 61 summarizing conclusions. 
Their highly argumentative monograph is certain to provoke rejoinders. 
They urge adoption of their sensitive methods for the information of great 
theoretical interest that these are producing. They cling tenaciously to a 
three-color theory, reject the notion of macular pigmentation as an explana- 
tion of individual differences, and accept the evidence for nonuniform dis- 
tribution of chromatic receptors in the fovea. Maxwell’s spot is regarded as 
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an entoptic indicator of the distribution of receptors. The primary objective 
of their research is to relate foveal receptor-type distribution patterns 
(RDP’s) to various types of color-blindness. The blue RDP? in protanoids is 
considered evidence that where redness receptors are absent their places in 
the retina are occupied by supernumerary blueness receptors. Deuteranopes 
and achromates do not see an RDP. Based on the different patterns of the 
RDP that normal subjects report, they can be classified into six types. Ten- 
tative quantitative interpretations of the relative numbers of receptor-types 
in different zones of the RDP are offered by Walls & Mathews as a basis for 
discussion and further work. Weale (238) measured both spectral sensitivity 
and wave length discrimination out to 70° from the fovea. Three regions of 
maximum sensitivity do not have clear parallels in fineness of wave length 
discrimination. Weale believes that differences in hue discrimination with 
retinal location will be accounted for in terms of post-receptor mechanisms. 

Three-receptor theory is being challenged. It is probably an oversimplifi- 
cation. Motokawa & Ebe (162) find maxima for spectral sensitivity at seven 
different wave lengths: six appearing at the fovea (415, 465, 515, 575, 610, 
and 650 my) and the seventh at 507 my 20° from the fovea. Motokawa & Ebe 
used alternating current to elicit phosphenes and found threshold-energy 
minima at specific frequencies. Sensitivity to these frequencies could be fur- 
ther enhanced by low intensity monochromatic light. The optimum spectral 
hue for a given AC frequency was identified. The range of AC frequencies 
employed corresponds nicely with that which Lindsley (141) reports he can 
pick up in the optic tract, although the visual cortex fails to follow frequen- 
cies beyond the middle of this range. The optimum sensitizing spectral wave 
lengths for the fovea correspond well, as Motokawa & Ebe point out, with 
the threshold sensitivity maxima found by Thomson (223). Thomson’s three 
red-insensitive observers lacked the maximum at the longest wave length. 
Additional data on the interchangeability of wave length with frequency of 
stimulation at the retina are presented by Motokawa & Ebe (160, 161) and 
by Hess (113). Two flashes of white light can be used to produce the same 
supernormality effect as a colored light if the proper interval between flashes 
is used (160), or intermittent white light will produce “subjective” color 
when applied to one retina, as opposed to the two retinae in alternation (113). 
The methods and abundant research coming from Motokawa's laboratory 
have been reviewed by Gebhard (90), and a stimulator suitable for pursuing 
research with these techniques is described (12). Gebhard (89) failed to find 
more than one minimum in a strength-frequency curve, but he only sampled 
two frequencies above 25 c.p.s. 

From a different approach Tschermak-Seysenegg (227) rejects three- 
component theory as inadequate. Although he thinks in terms of four pri- 
maries for chroma, he posits six visual substances. The complexity of retinal 
photochemistry is becoming increasingly apparent (48). There is evidence 
that absorption maxima vary considerably with pH. Dartnall (62, 63) re- 
ports two new visual pigments, which he identifies by their wave lengths of 
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maximum absorption, a practice which he suggests should become general. 
Collins (59) describes a series of experiments which demonstrate the close 
relation of indicator yellow to rhodopsin. Far from accepting it as an arti- 
fact, he regards indicator yellow as a key substance in visual chemistry. 

The retinal mechanism responding to red light has been found to have 
another index in the X wave of the electroretinogram (ERG). Armington (3) 
and Schubert & Bornschein (201) independently report that in protanopes 
the X wave, which according to Armington is most easily elicited by light 
in the region of 630 my, is not present. Armington reports further that this 
component of the ERG? dark adapts rapidly (within 2 or 3 min.). Armington, 
Johnson & Riggs (4) have analyzed the A wave of the ERG into two com- 
ponents, the second of which, in sequence of appearance, is dependent upon 
dark-adaptation and shows scotopic spectral sensitivity. Temporal factors 
relating the rod and cone systems are studied further by Bouman & Brink 
(29) and by Bartley & Wilkinson (15). At threshold levels the two systems 
show mutual dependence (29). 


DEVELOPMENTAL PROCESSES IN VISION 


Genetics and early maturation.—The development of vision is an exceed- 
ingly intricate process which is beyond the scope of any single discipline. 
Experts in the area are showing an increasing ability to take a multiplicity 
of factors into account. This is clearly revealed, for example, by Sorsby (216, 
p. x) in the Preface to his Genetics in Ophthalmology where he states: ‘‘The 
fatalism with which genetic affections are generally regarded is as unfounded 
in reason as it is unjustified by achievement already reached.”’ The implica- 
tion here that not all the eggs are in one basket is born out in many of the 
new studies of hereditary and environmental factors in visual development. 

Section I in Sorsby’s book is theoretical, covering modes of inheritance, 
human pedigrees, varieties of genetic disease, and prospects of controlling 
genetic disease. Section II treats ocular anomalies per se, and Section IIlisa 
discussion of metabolic and systemic functions and their ocular effects. The 
genetics of color blindness, thoroughly reviewed in Sorsby’s book, is also re- 
lated to specific types of defect by Jaeger (121, 122), by Walls & Mathews 
(236), and by Wright (241). Jaeger questions whether red-green disturbances 
are always sex-linked recessives, reporting the case of a deuteranomalous 
woman who has two sons, one deuteranomolous and the other normal. 

The problems of how and why vision begins to function are being attacked 
anew with some definitive methods. During the first two or three days after 
birth there is no ERG or it amounts to only a slight rise in base line [Zet- 
terstrém (246)]. The B wave increases gradually with age and by three 
months a measurable potential of 0.10—0.20 mv. has developed. The A wave 
develops between the sixth and twelfth months. Does this imply that the 
color receptors follow a similar time course of development? Unfortunately, 
our knowledge of the structural changes in the retina during infancy is too 
meagre to enable any close correlation with the ERG changes at present. In 
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premature infants of the lowest weight group the ERG appears after a con- 
siderably longer time. 


The fact that ERG appears in children born only a few weeks before term, within 
the same time as is required for children born at term, supports the theory that extra- 
uterine factors are of great importance for the appearance of ERG (247, p. 408). 


Brattgard (31) studied the effects of withholding light stimulation in new- 
born rabbits on the chemistry of retinal ganglion cells. The method, devel- 
oped in Hydén’s laboratory, permits the determination of lipids, nucleo- 
proteins, and proteins in single nerve cells, and the finding is that ganglion 
cells in young eyes deprived of light lack the pentose nucleoprotein fraction. 
This is interpreted by Brattgard as indicating that early postnatal absence 
of stimulation leads to incomplete development of the nerve cells and to dis- 
turbances in their metabolism that are not immediately recovered. The work 
has not gone far enough to indicate whether the changes may be irreversible; 
however, if such proves to be the case, it would seem that we have here an 
explanation of some results of prolonged light deprivation reported for chim- 
panzees (186). The work strongly confirms the need for diffuse light stimula- 
tion when the development of pattern vision is studied by a deprivation 
technique, as in several recent studies (186, 188, 204). But is there no ‘‘spon- 
taneous” neural activity in light deprived animals? Brattgard believes that 
the ganglion cells and those in the nuclear layers of the retina are the only 
nerve cells which can be effectively excluded from stimulation experimen- 
tally. Denton & Pirenne (68) agree in that they affirm “biological noise’’ or 
background excitation of the retina to be extremely low. 

Development of acuity—The satisfactory determination of acuity in very 
young children is still a difficult problem. Bouman et al. (30) have suggested 
an “objective” method, calibrated against Snellen opto-types, which may 
prove useful for this purpose. Keiner (127) claims to find an acuity of 5/5 
in many children by the age of three years. Sabatini (196), by contrast, re- 
ports improving acuity through age nine, or the fourth grade level. Both 
investigators urge the importance of psychological factors, and their appar- 
ently divergent results are testimony to the fact that acuity is not a ‘‘pure”’ 
factor and that different procedures will emphasize different components or 
requirements. By retinoscopy Hirsch (114) determined that between the ages 
of five and fourteen years the mean refractive state moves in a minus di- 
rection about 0.5 diopter. The median shows this change to a lesser extent, 
indicating a disproportionate contribution of extreme cases. Fewer than 1 
per cent of the five and six year olds had myopia in excess of 1.00 diopter. 
More than 5 per cent had this extent among the thirteen and fourteen year 
olds. A rather sudden increase in myopia occurs at puberty, a year or two 
earlier in girls than in boys. For changes in senescence Dufour & Cuendet 
(70) attempt to develop a single quantitative index for combined acuity, 
sensitivity, and visual field losses, and Kleemeier (129) shows that reported 
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loss of hue discrimination in older persons is based on methods that have not 
ruled out the factor of acuity. 

Amblyopia ex anopsia is considered to be secondary to poorly developed 
binocular coordination (127, 143, 216). Keiner cites his ERG results as fur- 
ther evidence that these monocular acuity deficiencies are central rather 
than retinal processes. 

Binocular coordination and stereoscopic vision.—A developmental point 
of view on the origins of binocular fusion and stereoscopic vision is steadily 
making inroads on the position that innately determined reflexes (or field 
forces) constitute the sufficient basis for these functions. On occasion the 
newer viewpoint is carried to extremes, particularly when urgent needs for 
the clinical treatment of ‘‘squint’’ arise. Nevertheless, some sound research 
is being stimulated by these needs. Keiner (127) reports observations on 
newborn babies of dissociated wandering eye movements and spontaneous 
monocular reactions, which give way after one to three weeks to transitory 
conjugate fixation movements in response to a moderately strong light stimu- 
lus located a short distance from the eyes. A difference in the reactivity, at a 
slightly later stage, to stimulation of the nasal and temporal halves of the 
retina leads Keiner to conclude that the crossed pathways are more predom- 
inantly connected with the binocular gaze center than are the uncrossed. 
After further study of 984 cases of squint, this author concludes that there is 
almost total dissociation of the two eyes at birth; familial predispositions are 
strong, but there is no such entity as congenital squint. Rather, strabismus 
is a manifestation of a retarded developmental process which normally 
reaches its highest activity at the end of the first six months and is complete 
at about eighteen months. During this crucial period binocular ‘“‘reflexes’’ 
are being conditioned, in the Pavlovian sense, to visual stimuli which orig- 
inally elicited only monocular optomotor responses. Hormonal, refractive, 
early illnesses, and a variety of other factors which may contribute to the 
retardation of this process are suggested. A majority of Keiner’s cases of 
strabismus was apparent before the age of one year, many cases spontane- 
ously recovering later in childhood, but the longer a squint was allowed to 
persist, the more difficult retraining of the optomotor reflexes became. 
Baird (10), Lancaster (135), and Lange (136) come to similar conclusions 
concerning the prior importance of fixational, rotational, and fusional re- 
sponses in the developmental sequence, and the need for exercising these 
functions as opposed to concern over visual acuity. Where binocular anoma- 
lies have developed, operation on the eye muscles is sometimes sufficient to 
permit the attainment of fusion or stereoscopic vision or both, according to 
Riise (189) and Portmann & Dufour (178), but the latter study indicates 
considerably improved prognosis where orthoptic re-education is added to 
the treatment. This is also the conclusion of a recent symposium by Rodman 
et al. (193). 

Sabatini (196) reports that stereoscopic vision continues to improve in 
school children through the fourth grade. Walls (235) makes an impassioned 
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plea for extending even a normal person’s zone of single binocular vision 
(Panum’s areas) by adding to the visual training armamentarium. Just 
what such training would do, if anything, to the still larger zones wherein 
depth is perceived in the presence of double images [Ogle (168)], no one has 
yet been able to say. 

The psychological-physiological basis for the development of anomalous 
correspondence is obscure. Its stability and extent is in dispute, largely be- 
cause the properties of the “false macula” differ considerably among differ- 
ent cases, but also because of the assumptions held by the clinician making 
the study. Colson (60) believes, with Verhoeff, that a squinting eye shows 
false correspondence to the fovea of the eye in central fixation anywhere 
along the line between and including the location of the ‘false macula” 
and the true fovea of the squinting eye. Halldén (105) used an afterimage 
method to measure the angle of anomaly. Five of nine young adults were 
found to have harmonious anomalous correspondence, i.e., the angle of ob- 
jective squint and the angle of anomaly were equal. By introducing prisms 
before one eye while these subjects were fusing, Halldén recorded fusional 
movements which completely compensated for the prisms. Sensory fusion 
was also demonstrated where the movement could not quite overcome the 
full power of the prism. Halldén is willing to entertain a conditioned response 
theory of the development of anomalous correspondence. Kretzschmar (132) 
set out to determine whether there was a redistribution of corresponding 
points over the entire horizontal axis of visual space. Such a reallocation of 
local sign is reported, the amount of displacement closely approximating the 
objective angle of squint. Small angle squint with anomalous correspondence 
is notoriously difficult to re-educate (178). Apparently this condition is a 
fairly satisfactory biological state of affairs. Vukovich & Miiller (231) have 
asked the question: what happens to fusion zones where there is merely a 
demonstrable heterophoria? They find that in exo- and esophoria there are 
opposite shifts in the boundaries of Panum’s areas, in peripheral as well as in 
central vision. An average horopter, however, remains within both ranges 
of single vision and is asymmetrical with respect to each type of deviant fu- 
sional zone. 

All these data raise the question: what happens to accommodation in 
conjunction with atypical eye rotations? Fry (84) reports that, although ac- 
commodation and convergence are closely linked physiologically, some in- 
crease in the flexibility of the relationship can be achieved by training. He 
believes that fusional convergence and accommodative convergence, while 
operating through different mechanisms, may, nevertheless, utilize a com- 
mon center in the midbrain. Lord (144) measured eye rotations with changes 
of accommodation and reports failure to confirm Hering’s law, although a 
change of accommodation was found to be associated with a saccadic excur- 
sion of the eye. In a subject who showed suppression in one eye, no move- 
ments accompanied change in accommodation, and “it thus seems likely 
that binocular visual experience is entirely responsible for eye movements as- 
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sociated with change of accommodation . . .’’ (144, p. 671). Piper (175) does 
not assume such independence. On the contrary, he hooks up accommodation 
with convergence or divergence of the eyes and then places this compound 
mechanism all at the mercy of the autonomic nervous system. The main 
experimental findings are that in esophoria accommodation for near is slow, 
and the converse holds for exophoria. Arguing from the effects of drugs on 
accommodation [Monjé (156)], Piper concludes that if the parasympathetic 
division is poorly controlled (released, as in youth) then adjustment for near 
is accompanied by an overshooting impulse and esophoria results. The 
adult tends toward sympathetic release and outward deviations. No data on 
the relative incidence of these phorias with age are given. It will come as 
something of a shock to old line physiologists to read that: ‘For most ob- 
servers the conscious estimate of the position of an object plays a part at 
least as important as accommodation in determining convergence” [Asher 
(5, p. 675)]. Those who have accepted some form of conditioning theory 
may be able to look with favor upon this statement. The data presented by 
Asher are quantitative and worthy of consideration. 

Binocular equivalence and binocular rivalry.—Binocular equivalence, inso- 
far as it is measured by learned responses to stimuli presented initially to 
only one eye, is far from complete in adult birds [Levine (139)], and even less 
so in young birds deprived of form vision from the time of hatching [Siegel 
(204)]. Riesen, Kurke & Mellinger (188) report that, whereas young, visually 
experienced cats show transfer of monocularly learned habits immediately, 
cats whose prior visual experience through one eye was restricted to diffuse 
light could not perform habits when seeing through that eye, even when 
these habits had been overlearned during use of the other eye. They inter- 
pret these results as indicating that binocular equivalence in the adult mam- 
mal is dependent in part upon early conditioning processes involving se- 
quences of activity in specific neural pathways. They assume that these co- 
ordinating pathways can be initially activated as discrete pathways only 
by patterned visual stimulation. It would be extremely helpful in this con- 
nection to know whether interocular transfer of afterimages of movement 
of the kind described by Walls (234) would occur in the case, say, of a person 
just operated upon for congenital cataract. Ohwaki & Kihara (169) report 
observations of the so-called ‘‘Bocci image’’ (a faint afterimage observed in 
one eye after stimulation of the other) and regard this as having some of the 
characteristics of eidetic images. These authors find variation with age. 

The problem of the sensory integration of impulses from the two eyes has 
one of its counterparts in the process of rivalry. Phylogenetically, monocular 
vision is prior to binocular. Evidence cited above is in agreement with the 
view that this is true also in human ontogenesis. Optic pathways in the mam- 
mal are obviously better arranged for binocular coordination of sensory in- 
put than are those of the bird. Nevertheless, even in adult man vision seems 
to become monocular under certain conditions. Asher has just revived an old 
theory of du Tour, supported it with some new observations on rivalry in 
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stereoscopically presented patterns, and proposed that (6, p. 49), “‘of a pair 
of corresponding points, one always suppresses the other.’’ Binocular depth 
is related to the rivalry in the suppression process. Asher overcomes the 
problem of binocular color mixture by postulating a mosaic effect in which 
a fine grain mental picture results from the close intermingling of dominant 
and suppressed points. Alexander & Bricker (2) report only partial success 
in their effort to influence rate of rivalry by raising or lowering contrast. 
Steinitz (220) questions the doctrine of precise correspondence of points and 
urges adoption of a more plastic conception. His argument becomes circular 
when he says that images fuse when and because they occupy the same posi- 
tion in psycho-optical space. 

Development of pattern recognition and space perception Gibson & Wad- 
dell (99) have demonstrated that depth perception is dependent upon differ- 
entiated retinal images. Gibson & Dibble (98) eliminate the perception of a 
surface as a possible “elementary” impression. Teas & Bitterman regard 
their experiments as evidence that aggregations of afferent visual processes 
function initially as ‘‘loosely organized wholes out of which the perception of 
objects and relations is subsequently differentiated’”’ (222, p. 140). These 
observations place a high premium on the study of area processes in vision. 
Boring (23, 24) and Gibson (94) have argued about the best approach. De- 
velopmental studies are contributing to the knowledge of this subject, and 
other work will be covered below. 

Hebb’s theoretical position has been the inspiration for a number of de- 
velopmental studies in man and animal. Experimental variation of the early 
visual environments of rats has produced consistent evidence for the contri- 
bution of the opportunity for visual exploration to later behavior (20, 80, 
117). A late “free’’ environment is not a substitute for an early one. ‘‘Play- 
things” also help. Siegel’s experiments with the ring dove (205) provide 
further evidence that visual learning is a cumulative process. 

Pattern discrimination, as measured by word recognition, is somewhat 
specific to the area stimulated on the retina if there has been differential 
practice for such areas [Mishkin & Forgays (154), Orbach (171)]. Better rec- 
ognition for words in the right side of the visual field is found only in the 
later grades of elementary school (79). These data are taken as evidence 
against an innate equipotentiality of different parts of the visual field. 

Thresholds for the perception of patterns are shown to vary with recent 
past experience in studies by Atkinson & Ammons (7) and Bevan & Zener 
(19). The apparent size and distance of a triangle was altered by recent ex- 
perience with objects of this shape [Smith (210)]. The apparent size and 
distance of a control figure was not altered. Kume (133) concludes that past 
experience is one of the main factors determining size constancy. In a dis- 
torted-room setting marriage partners were found to be less susceptible to 
size distortion than were strangers [Wittriech (240)]. Harper (108) reports 
that figures which are characteristically red (like a lobster) require more red 
in the background before they become indistinguishable from it than do 
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figures which ordinarily have no characteristic color (like an oval) but which 
are equally red for this particular experiment. 

Several comparative studies on primates deal with phylogenetic aspects of 
pattern vision. Gorillas at a fairly early age perform complex patterned 
string problems at levels superior to those of monkeys [Riesen e¢ al. (187)]. 
Complex visual pattern discrimination in the chimpanzee is demonstrated in 
the work of Hayes & Hayes (111, 112) on picture identification and imitation 
behavior. According to Cole’s experiments (58), form is subordinate to color 
when monkeys can learn visual discrimination habits on the basis of either 
cue. Nevertheless, by eliminating the color cue on critical trials, Cole found 
that form was effective in determining response in favor of this previously 
rewarded subordinate stimulus. Butler (45) interprets his new observations 
to mean that there is a primary motivation toward visual exploration in the 
monkey. 

These recent studies contribute to the understanding of how learning 
supplements in a highly significant way the innate visual adaptive responses 
of the organism. We cannot say that all innately determined responses are 
seen prior to those dependent upon associative processes. Fear of strange 
stimuli, shown to be present in human infancy by Gesell and co-workers and 
by other earlier studies, and now demonstrated for primates (186) and dogs 
(151), certainly requires learning but apparently has a strong innate deter- 
minant. The protective blinking reflex to menace (127, 186) and the visual 
placing response (188) are further examples of behaviors that become highly 
automatic, yet depend on prior environmental conditions suitable for the 
establishment of conditioned responses. Keiner (127, p. 131) conceives of 
vision as resulting from the cooperation of optomotor, associative, and per- 
ceptual components. The interaction of these factors during early develop- 
ment is clearly indicated in an increasing number of experimental studies. 

Agnosia.—Once established, visual recognition behavior may be lost 
again through brain damage, accidentally or experimentally produced. Inter- 
pretations of such losses are entering a new theoretical phase [Bay (16); 
Bender (17); Semmes (202)]. It is proposed that ‘‘associative’’ cortex may 
return impulses to the primary sensory areas and there stabilize or modulate 
the sensory process. Instability of sensory thresholds is found even in 
“pure” agnosias (16), and sometimes the most sensitive test employs simul- 
taneous bilateral stimulation where central interaction accounts for the loss 
of response to the damaged side [Bender (17); Denny-Brown et al. (67)]. 

Parietal and temporal association areas are implicated in form recogni- 
tion [Chow (51, 52); Denny-Brown et al. (67); and Riopelle et al. (190)]. 
Chow finds that the conditions of postoperative visual experience are a 
factor in the “spontaneous”’ recovery of lost habits following removal of 
temporal cortex, and Riopelle and co-workers report that the “learning set’”’ 
task was seriously affected by bilateral (two stage) invasion of the temporal 
lobes, whereas solutions of patterned string problems were not. 

Vision with spatial inversion.—Snyder & Pronko (213) have repeated the 
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Stratton experiment, essentially, with Snyder wearing the inverting lenses 
for a 30-day period. There actually appears to have been more nausea, eye 
fatigue, and headache after removal of the lenses than when they were first 
donned. The authors take the extreme empiricistic position in their interpre- 
tation. 


SPATIAL DISCRIMINATIONS 


Form perception.—Sleight & Duvoisin (209) have compiled an annotated 
bibliography of form perception in which emphasis is placed on the relative 
discriminability of geometric shapes. With sorting time as the criterion, 
Sleight (207) finds that discriminability for some geometric forms is better 
than for others by a factor of ten. The “‘easy’”’ and “hard” figures do not 
bear any clear relation to “simple” figures in the Gestalt sense. Threshold 
studies give different and apparently contradictory results, depending upon 
the method. Naruse (165) used brightness thresholds and found triangles 
and rectangles were more easily seen than circles. Cheatham (50) used 
Pieron’s method for measuring perceptual latency and reports that angles 
were seen later than smooth contours. Bruner et al. (37) find, in line with the 
well-known Carmichael, Hogan and Walter experiment, that ambiguous 
figures given verbal labels in advance of exposure are reproduced more in 
accordance with such a label than when they are not labeled or are given 
another label. The horizontal-vertical illusion in lines [Pollock & Chapanis 
(177)] or in geometric figures [Sleight & Austin (208)] does not hold for all 
subjects. In general, lines tilted 20° to 30° to the left look longer than lines 
in any other orientation. Lichte (140) measured shape constancy for a square 
rotated on its vertical axis until it appeared equal in shape to one of four 
standard rectangles. The Brunswik ratio (index of constancy) decreased as 
angle of rotation increased. Wide individual differences were found. 

Apparent size is determined by retinal angle if distance cues are effec- 
tively eliminated [Hastorf & Way (110)]. Under the conditions of Smith’s 
(212) experiments perceived size may lie outside the range between the limits 
that would be predicted from the “‘law of size-constancy”’ and the “law of 
the visual angle.” Young (244) reports that Emmert’s law does not accu- 
rately predict the size of afterimages over large distances. With high stimulus 
luminances, afterimages were smaller than would be expected. 

The principle of closure in form perception continues to resist precise 
definition. For specific sets of experimental conditions some limiting factors 
are being measured. Depending on the size of gap, and on the time of ex- 
posure, an incomplete circle is changed in tachistoscopic perception toward 
“closure” or its opposite, enlargement of the gap [Tresselt & Simberg (226)]. 
As either supplementary or as alternative to orthodox Gestalt theory, which 
relates closure, proximity, and the like to dynamic processes of the brain 
field, current studies propose and find evidence for ecological or ‘‘expect- 
ancy” factors [Brunswik & Kamiya (39); Postman & Bruner (182)]. Ap- 
parently failing to confirm this experiential interpretation is Michael's study 
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(152), in which Navajo and white subjects did not show significant differ- 
ences in the numbers of ‘‘closure’”’ responses. Turner (228) proposes an aes- 
thetic factor as a supplement to the principle of Prdégnanz. George (92) con- 
firms the Hebb-Foord findings and rejects this principle for delayed repro- 
ductions. The derived principle, flexibility of closure, is found stable enough 
as a measure of individual differences to give good correlations with an- 
alytical ability or deductive thinking [Pemberton (174)]. Ogasawara (167) 
describes a quantitative study of displacement effects: the change in ap- 
parent size of a circle when a concentric circle is added to the figure. 

Figural aftereffects Christman (54), Deatherage & Bitterman (66), and 
Kakizaki (125) report phenomena which can be fitted into the theory of 
satiation, but Azuna (8) and Luchins & Luchins (145) find the need for sup- 
plementary principles to explain certain aftereffects and changes in illusions. 
Kohler, Held & O’Connell (130) discuss some facts of visual perception not 
explained by the neural conduction doctrine. In support of their field theory 
they report extensive records taken chiefly over the striate cortex when ob- 
jects were moved through the visual field. The same electrical polarity rela- 
tions occurred whether the moving objects were bright or dark. Are these 
figure currents related to the phenomenon recently reported by Burns & 
Grafstein (44)? They describe a surface-positive response obtained from 
stimulating an isolated slab of cortex with intact blood supply. The response 
“spreads without attenuation in all directions from the stimulated point.” 
Knife cuts are said to show that the spread depends on dendrites and synap- 
ses lying below 1 mm. in the grey matter. 

An alternative theory which attempts to account for the better known 
effects on perception of previous form stimulation in terms of principles of 
conduction along discrete pathways is offered by Osgood & Heyer (172). 
They point out, after Marshall and Talbot, that by the time retinal events 
produce effects at the cortex, some dispersion in space and time will have 
occurred. The spatial distribution of excitation along contours will have a 
shift in its maximum as the result of previous excitation. Thus, “‘inspection”’ 
contours leave individual neurones in unequal states of recovery according 
to symmetrical distributions, which will then shift the zone of maximum ex- 
citation when a new contour appears in an adjacent region. This theory 
would demand an increase in threshold for luminous points near zones of 
previous stimulation by an inspection figure. Two recent studies [Bevan 
(18); Yokose & Uchiyama (243)] report just such changes, although it is 
odd that both should have been motivated by the conception that they would 
support Kohler’s theory. Both the satiation theory and this neuron recovery 
theory still leave much to be desired in the way of precision before they can 
be put to critical tests. At present, it is difficult to see at just what points 
they may be incompatible, if they really are. There is still the big question 
of what happens beyond the striate cortex. Field theory has nothing to say 
on this problem, and two recent studies demand a central explanation for 
perceptual changes that cannot be referred to past or present retinal events 








74 RIESEN 


or their isomorphic counterparts [Mull, Arp & Carlin (164); Solomon & 
Werner (215)]. Three dimensional phenemona of apparent movement, such 
as those described and correlated with objective conditions by Burke (40), 
Ekman (72), Hall e¢ al. (104), and Smith (211), are also difficult to translate 
into theoretical terms. 

Size and depth perception—The extensive and highly competent dis- 
cussions of space perception in Linksz (143) and Tschermak-Seysenegg (227) 
should be recommended reading for every psychologist. Linksz describes 
many teachable demonstrations and both authors discuss European findings 
that are being “rediscovered” in America, often without benefit of familiar- 
ity with the previous work. Linksz (143, p. 426) describes a convincing self- 
demonstration of convergence as a cue to distance (and size) for relatively 
near objects. Chalmers (47), using a dark tunnel to control secondary cues, 
compares monocular and binocular viewing conditions for size and distance 
judgments. Monocular vision fails beyond 25 feet and binocular vision be- 
gins to fail at 30 to 80 feet. Gumbel (103) reports an advantage for binocular 
viewing in depth judgments even at scotopic levels of illumination. Physio- 
logical mechanisms which mediate the accommodation, convergence, and 
binocular disparity factors in space perception are further discussed by 
Asher (5, 6) and by Ludvigh (146). 

The apparent slant of a plane surface normally depends on a variety of 
cues. Cibis et al. (57), introducing cues, one at a time, find that perspective 
attributable to horizontal boundaries (for a small, rectangular surface ro- 
tated about the vertical axis) proves more effective than vertical stripes or 
irregular markings. Stereoptic cues were almost as effective as the strongest 
perspective cue under their conditions. Gibson and co-workers find that a 
gradient of density of visual detail is one sufficient and highly effective con- 
dition for producing the perception of slant (97), but motion perspective is 
not (96). 

When do two-dimensional forms appear three-dimensional? Wallach & 
O’Connell (233) point out that no rules of ‘‘spontaneous organization”’ are 
adequate to predict this. Of course, the empiricists have their answer. In the 
case of moving forms, produced by turning a three-dimensional wire-edged 
or solid form behind a translucent screen so that only a two-dimensional 
shadow is seen by the observer, one essential condition for the perception 
of a third dimension is that the shadow must display contours or lines which 
change their length and their direction simultaneously. This is Wallach’s 
“kinetic depth effect.”” For any other form to appear three-dimensional there 
must have been previous perception of that form in three dimensions, these 
authors conclude. Fry (82) presents a general mathematical formulation of 
relations governing stereopsis and geometrical perspective for plane figures. 

Size and distance judgments are intimately related, and Gibson (94) 
makes a plea against trying to separate the two factors except for very lim- 
ited experimental purposes. Boring (23, 24) says that such analysis is the 
essence of scientific method in psychology. Through an ingenious use of re- 
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versible figures Hochberg & Hochberg (115) attempt to separate familiar 
size from relative size as determinants of perceived distance. That their ex- 
periment does so is open to question. Where they try to eliminate familiarity, 
they use the identical boy in two sizes for comparison figures. He is familiar 
by the time he is perceived on both surfaces. Kume’s method (133) seems to 
eliminate familiarity more adequately for he used disks which lack identity. 
Both experiments are said to show that larger objects are seen as nearer than 
smaller ones, where distance is ambiguous. 


VISIBILITY, DETECTION, ORIENTATION 


Problems of outdoor visibility are treated at length by Middleton (153). 
Many specialized topics in this area are currently under investigation. 
Examples are the Tousey & Koomen study (225) of the increasing visibility 
of stars during twilight and the research by Sexton, Malone & Farnsworth 
(203) of the visibility of different colors against a simulated sea background. 
The poor visibility of yellow came out in both studies, interestingly enough, 
since this has been thought the color of choice for life rafts. Orange Alde- 
baran appeared early in the night sky, relative to its magnitude. Red was 
more visible than other hues against the blue-grey of the ocean. Visual prob- 
lems to be expected in very high altitude flying are discussed by Cibis (56). 

Problems of lighting, legibility, and spatial arrangement in visual dis- 
plays are central in human engineering research. The new JES Lighting 
Handbook (118) is invaluable for psychologists concerned with this area. 
The problem of specifying luminance at mesopic and scotopic levels is re- 
viewed, and a luminance scale is offered for converting from cone to rod 
levels by a series of multiplying factors [Bridgman (32)]. Spragg & Rock 
(217, 218) find a critical brightness level of 0.02 ft. L. for dial reading, above 
which there is little or no improvement in speed or accuracy, and color is 
a minor factor. Illumination (not luminance) was the factor studied by 
Tinker (224) in an investigation concerned with reading six-point italic 
type. The critical level he reports was between 10 and 25 ft. candles. Fluores- 
cent flicker is found to reduce cff but not significantly affect reading per- 
formance (245). The legibility of letters and digits was studied by Crook, 
et al. (61) and Schapiro (198). Widths and spacing of markers and pointers 
in dial design for precision of reading were investigated by Carr & Garner 
(46). Christensen (53) finds interactions between type of dial, exposure 
time, and practice. Eriksen (73, 74) studied the general problem of locating 
specified objects within a multitudinous display. Hue proved more effective 
than any other single dimension in facilitating speed of the locating and 
designating response. Objects unique in several dimensions were still more 
quickly located. 

Solomon & Postman (214) report that frequency of experimental usage 
is a determinant of thresholds for tachistoscopic recognition for nonsense 
words. McGinnies, Comer & Lacey (150) found that word length and word 
frequency (familiarity) were, both separately and in interaction, significant 
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determiners of variation in recognition thresholds. Lacy, Lewinger & Adam- 
son (134) used instructions to manipulate thresholds for ‘‘taboo’”’ words. 
The highly familiar sequences, the alphabet, and numbers from 1 to 20, 
were used by Mowbray (163) to compare the efficiency of vision and audi- 
tion, singly or in competition, for the detection of missing elements. For the 
single modality audition was less efficient. Errors increased about equally in 
both modalities when two tasks were presented simultaneously, except that 
a significantly larger increase in errors of commission occurred with visually 
presented sequences. 

Orienting with respect to the gravitational vertical is dependent upon 
cooperation or conflict between visual and proprioceptive mechanisms. 
Where there is conflict the response may be a compromise one (95, 97, 148) 
or one modality may dominate (148). Instructions and the sophistication 
of S are variables which can influence the relative dominance of conflicting 
cues [Mann & Boring (149)]. Prior tilt of the test rod effects the perceived 
vertical [Werner & Wapner (239)]. Mann (147) studied the duration of the 
oculogyral illusion by using a flickering vertical line presented above critical 
fusion frequency to produce a ‘‘picket fence’ phase. The relation between 
revolutions per minute and duration fits Mayne’s formula. Visual cues 
utilized in orientation during flight in aircraft are discussed by Cibis (55) 
and Rose (194), who agree that size of image is an important cue, that 
binocular cues are relatively unimportant, and who are not in full agreement 
as to whether motion parallax or linear perspective should be considered the 
other cue of major importance. Gerathewohl (93) discusses the problem of 
predicting the disorienting effects that will be encountered under sub- 
gravity or zero-gravity states. Accidents on superhighways involving ve- 
hicles moving in the same direction at different rates of speed prompted the 
experiment by Stalder & Lauer (219). They investigated the effect of visual 
patterns on the perception of relative motion, and found that a sharply 
delineated area was superior to small light spots or checkerboard in produc- 
ing accurate estimates of the rates of approach. 

Emphasis on perceptual variables in eye-hand coordination is seen in a 
number of recent studies (77, 101, 184, 192, 197). Space does not permit 
discussion of this very important applied area of research. 


MoTIVATION IN VISUAL PERCEPTION 


Motivation was seen to be a minor but somewhat troublesome variable 
in simple. threshold studies (22). It is undoubtedly more significant in 
highly ambiguous conditions of stimulation and response. The literature in 
this area is still full of contradictions. Motivating conditions are too often 
inferred rather than controlled, although an increasing number of studies 
are dealing more effectively with this variable. Whether to regard a given 
change in response as a measure of a perceptual process or an alteration in 
response tendency as a result of a shift in ‘‘set’”’ is a difficult theoretical 
issue. Some progress is being made in the effort to disentangle this issue 
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experimentally [Postman & Crutchfield (183)]. Whether motivation operates 
directly at the moment of perception and response, or whether it has oper- 
ated exclusively in prior situations to produce selective learning is not always 
clear in a given experiment. The reader interested in following current 
theoretical issues in this area should consult Postman’s recent discussion 
(179). 

The concept of ‘‘perceptual defense” is under dispute. An experiment by 
Taylor (221), who conditioned the galvanic skin response to geometrical 
figures, supports the position that discrimination is possible before a visual 
stimulus can be identified verbally. Postman, Bronson & Gropper (180) 
find no support for the concept of ‘“‘perceptual defense’ as a variable influenc- 
ing thresholds for emotionally toned words, nor do Bitterman & Kniffin (21). 
Postman & Brown (181) were able to alter word recognition thresholds by 
manipulating the preceding success or failure experiences of the subjects. 
Their results are closely related to those cited earlier on the effects of fre- 
quency and expectancy (134, 150, 214). 

Fraser (81) used a new visual vigilance task in which the presence or 
absence of the experimenter produced a reliable difference in the number of 
signals seen, a difference contributed entirely by seven of the 18 Ss. The 
nature of motivational differences is merely implied. Gilchrist & Nesberg 
(100) varied hunger or thirst by imposing up to 20-hr. of food or 8-hr. of 
fluid deprivation. Positive time error in the illuminance matches of objects 
related to need increased with increases in the periods of deprivation. This 
is a variation of the experiments on changes in the magnitude of a psycho- 
physical dimension as related to symbols of need-reducing objects. Effects 
on the apparent size of objects are reported in experiments by Bruner & 
Rodrigues (38), Dukes & Bevan (71), and Singer (206). Hastings (109) 
used a modified ‘‘thereness-thatness” demonstration of Ames and reports 
that Ss rating high on insecurity tended to see objects nearer than did more 
secure subjects. 
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Auditory science continues its steady development in both its pure and 
applied aspects, and the past year has seen the publication of several impor- 
tant summaries that will help psychologists keep abreast with the advances 
in the knowledge about hearing. The recent Handbook of Experimental Psy- 
chology (96) continues to be the best single source for a detailed presentation 
of the facts and concepts of audition. Geldard (38) has provided us with a new 
general treatment of the senses, and the wealth of information organized by 
him shows how mature the study of sensation has become. Hirsh’s book (49) 
presents the basic data and the fundamental concepts of selected areas of 
psychoacoustics and then shows how this material is related to the measure- 
ment of deafness. His discussions of auditory fatigue and recruitment are 
particularly welcomed, since these important topics have a widely scattered 
literature. For those psychologists interested in deafness and related prob- 
lems, the quadrennial review of Progress in Ophthalmology and Otolaryngology 
(56) will be helpful. A collection of the papers read at the First International 
Course in Audiology held at Stockholm in September, 1950, constitutes two 
separate volumes of Acta Oto-Laryngologica (1). Taken as a group these paper 
provide an excellent source book for studies in audiology. 

Fletcher’s comprehensive monograph (29) in some 450 pages reports 
primarily the extensive researches conducted by investigators at the Bell 
Telephone Laboratories over the past 35 years. The concept of the critical 
band and its relation to hearing functions, the computation of the articula- 
tion index from a few physical and psychophysical parameters, Fletcher’s 
space-time pattern theory of hearing, and many other facts and principles 
established since the first edition of 1929 are included. It is indeed fortunate 
that Dr. Fletcher has brought together in a single volume the monumental 
contributions made at the Bell Telephone Laboratories while he was Director 
of Acoustical Research. 


ANATOMY AND PHYSIOLOGY 


The fundamental advances toward our understanding of the mechanical 
and electrical dynamics of the cochlea have been crystallized in the splendid 


1 The survey of the literature to which this review pertains was completed in 
June, 1953. 

2 The following abbreviations were used in this review: DL (difference limen); 
SPL (sound pressure level); j.n.d. (just noticeable difference); MCL (most comfort- 
able listening level). 

3 The writer acknowledges the aid of Mr. Edward C. Carterette in the preparation 
of this review. 
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review by Licklider (65). Thus, without slighting this important area of 
work, we shall examine only a few articles dealing with the anatomy and 
physiology of hearing. 

Anatomy of the cochlea.—New techniques are being applied in a concerted 
effort to establish information on the detailed structure of the cochlear par- 
tition and the innervation of the organ of Corti. It is more evident than ever 
that a clear and final understanding of the events that transpire between the 
motion of the tympanic membrane and the discharge of the auditory nerve 
must rest upon the facts of the function and morphology of the observed 
physical elements themselves. 

Wolff (113) provides us with a more complete evaluation of the superb 
work of Alfonso Corti (1851) and includes some of her own beautiful photo- 
graphs on the microscopic anatomy of the cochlea. From her observations 
she believes that the tectorial membrane is attached at the so-called Hensen’s 
node to the inner hair cells, and she expresses no doubt that the outer hair 
cells are imbedded in the tectorial membrane. 

Katsuki & Covell (59) studied the organ of Corti under phase contrast 
microscopy. They throw doubt upon the function of Hensen’s cells as sup- 
porting elements for the organ of Corti, provide further evidence that the 
hair cells are the most fragile components of the organ of Corti, and agree 
with Hilding’s description of the tectorial membrane. 

Engstrém & Wersill (25) are studying by modern electron microscopical 
methods the structure of the outer hair cells of the inner ear. 

For a very long time there has been controversy over the question as to 
whether the tectorial membrane is attached to the organ of Corti. The alter- 
ations produced by dissection and fixation have made it difficult to determine 
the precise microanatomy of the ear. Hilding (48) dissected some 130 tem- 
poral bones, mostly fresh, about 100 of which were from man. Whenever 
feasible, dissection was made immediately without fixation, and staining 
was done in situ with simple stains. According to his observations the tec- 
torial membrane is inserted upon the organ of Corti and is in contact with the 
hairs of the hair cells. In certain preparations, the basilar membrane and the 
organ of Corti were made to rise and fall and the tectorial membrane rode 
upon the organ of Corti without apparent shearing motion at right angles to 
the long axis of the cochlear partition. Certain important properties of the 
tectorial membrane were determined. The following theory was then pro- 
posed. Because the fibrils of the tectorial membrane are almost parallel and 
are directed toward the apex and outwardly, and because the tectorial mem- 
brane is nonelastic in the longitudinal direction but is elastic in the transverse 
direction, an extended movement of the tectorial and the basilar membranes 
will nevertheless result in a shearing force between these two membranes at 
only a small locus of flexure as the travelling wave proceeds down the coch- 
lea. If Hilding’s model is correct, it provides a mechanical sharpening mech- 
anism for frequency resolution by the cochlea. [See (65, pp. 89-92).] 

Certain dimensions of the cochlea are important for understanding its 
mechanical and electrical action. Fernandez (26) provides data for the guinea 
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pig similar to those published by Wever (108) for man. Some of the dimen- 
sions plotted as a function of the distance along the basilar membrane for 
the cochlea of the guinea pig are: the distance across Rosenthal’s canal and 
the osseous spiral lamina, the width and thickness of the basilar membrane, 
the cross sectional area of the three channels of the cochlea, and the angles 
formed by the axes of the outer and inner hair cells with one another and 
with the basilar membrane. 

Spatial origin of the cochlear microphonic.—Tasaki & Fernandez (99) de- 
veloped techniques by which they could (a) record simultaneously electrical 
responses from different turns of the guinea pig cochlea, (b) perfuse the 
cochlea with chemicals (potassium chloride), and (c) polarize the sensory 
endings in the cochlea by D.C. Very fine electrodes were introduced into a 
given turn of the cochlea, one into the scala vestibuli and the other into the 
scala tympani (differential recording); great care was taken to avoid injury 
to the scala media. Reversible changes in the magnitude of the cochlear 
microphonic and nerve potentials were induced with a perfusion of the canals 
with a potassium chloride solution, or by a polarizing D.C. across the coch- 
lear partition. The potentials recorded were shown to arise from activity in 
the neighborhood of the electrodes and to be independent of the cochlear 
microphonic originating in other turns of the cochlea. It was shown that the 
cochlear microphonic is generated in the basal turn in response to both low 
and high frequencies, whereas the third and apical turns respond only to low 
frequencies. Previous investigators who have recorded the aural microphonics 
as a function of the stimulating frequency with one electrode on the round 
window and the other on the neck evidently were not obtaining information 
about the activity of the cochlea in the upper turns. Obviously, this is not to 
say that the electrical activity recorded at the round window is uninforma- 
tive, for example, see Heise & Rosenblith (46). 

Neural activity of the medial geniculate body—Galambos et al. (33, 35, 
87) present in detail the results of their meticulous exploration with micro- 
electrodes of the potentials whose origin was shown to be in the neural struc- 
tures of the pars principalis of the medial geniculate body in cats. Of their 
major findings we mention only those concerned with the latency, the in- 
tensity, suppression (masking), and the tonotopic organization of the neural 
activity of the medial geniculate body. (a) Latency: the first sign of electrical 
activity after an acoustic click occurs at about 6 msec. with a small range of 
latencies, and spike potentials arise at about 10 msec. It is estimated that 
the major afferent volley to the medial geniculate in response to a sharp click 
is temporally dispersed about 10 times by the acoustic pathway. (b) Intensity 
of acoustic click: As stimulus intensity increases, the latency decreases and 
the percentage of clicks that evoke spikes rises from zero to a maximum (80 
to 100 per cent) in about 20 db. (c) Suppression: At the medial geniculate, 
one of the most striking effects of pure tone stimuli is their suppression of the 
electrical response due to a click. (d): Tonotopic organization: Pure tone 
audiograms on medial geniculate units indicate that the typical element re- 
sponds to a relatively wide range of frequencies at its threshold intensity. 
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The range of effective frequencies becomes even wider as the stimulus in- 
tensity is increased. These latter results are not in accord with data from 
other lines of investigation which have indicated tonotopic organization. 

Hearing and cochlear lesions Schuknecht & Neff (91) provide new infor- 
mation as to the effect upon hearing in cats of surgical insult to the receptor 
and neural elements in the apical region of the cochlea. The absolute thresh- 
olds for pure tones were determined by avoidance conditioning both before 
and after the cochlea was entered with a very small needle. The cochleas 
were examined histologically. When damage to the hair cells was confined to 
the apical region, audiograms showed that the principal hearing loss was be- 
low 1000 c.p.s. As the damage became more extensive, sensitivity at the 
higher frequencies was also diminished. 

Hearing and cortical lesions —Further evidence on the complex organiza- 
tion of the different areas of the auditory cortex involved in frequency and 
intensity discrimination was published by Meyer & Woolsey (76). The orig- 
inal article must be consulted to appreciate the high caliber of this research, 
and only the principal results can be given here. Cats were trained to respond 
to a small change in the frequency (in some cases, intensity) of a pure tone of 
1000 c.p.s. Then, bilaterally symmetrical cortical ablations of various com- 
binations of auditory area I, auditory area II, somatic area II, and the pos- 
terior ectosylvian gyrus were made. Retraining was then undertaken. The 
results show that it is possible, through surgical intervention, to dissociate 
frequency from intensity discrimination. Removal of all the combined areas 
listed above abolished frequency discrimination but left intensity discrim- 
ination relatively unimpaired. Ablations of less than all these areas did not 
result in quantitative deterioration of frequency discrimination. 

Toxic effects of streptomycin.—Large doses of streptomycin in its present 
form result in damage to the sensory epithelium of the vestibular and audi- 
tory apparatus. Space permits only references (10, 43, 44, 90, 102). 


PSYCHOPHYSICS OF THRESHOLDS AND ATTRIBUTES 


Absolute sensitivity and Brownian motion.—Under ideal conditions for 
the detection of a signal, certain of our sense organs respond to an incredibly 
small amount of energy. In fact, the random fluctuations of pressure on a 
membrane as a result of Brownian noise in the air is not far below the pressure 
required at the absolute threshold of hearing at 2000 c.p.s. De Vries (22) 
points out that in the consideration of the energetics of the threshold, the 
important quantity is the Brownian motion of the eardrum, because this 
membrane is not only acted upon by Brownian noise of the air, but it is 
coupled to a system (the middle and inner ear) that generates Brownian 
noise. His calculations show that Brownian motion at the eardrum is above 
the measured threshold. He then reasons that since the threshold is deter- 
mined in the presence of Brownian noise, the actual threshold energy is the 
sum of the measured stimulus energy and the Brownian motion. De Vries 
then goes on to consider the Brownian motion in the inner ear, and his calcu- 
lations show that at threshold the power from the stimulus available to a 
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single hair cell (if not amplified) is far below that attributable to the Brown- 
ian motion in the cells. The observations by Davis et al. (18) and by Békésy 
(5, 6) establish the existence of D.C. potentials across and within the coch- 
lear partition which are maintained continuously by the metabolism of the 
sensory cells. Amplification of the stimulus energy is thus possible, and the 
energy available for excitation of the nerves may greatly exceed the energy 
from the acoustic stimulus. De Vries holds that electrically the cells are 
rigidly coupled in parallel so that the electrical effects of Brownian motion 
of an individual cell are distributed and therefore suppressed. On the other 
hand, a stimulus moves a number of cells in phase and the potentials gen- 
erated act together. 

The audibility curve-——The accurate calibration of audiometers requires 
that the procedure for measuring the threshold of hearing in terms of mini- 
mum audible pressure be standardized, and that a large group of normal list- 
eners be tested by this procedure over the entire audible range. Dadson & 
King (17) report on their progress in the standardization of audiometers and 
procedures for determining the absolute threshold. It is hoped that such data 
will establish a standard reference zero based on the normal ear. Their re- 
sults are in excellent agreement with those reported below by Wheeler & 
Dickson, and in substantial agreement over the range 250 to 8000 c.p.s. with 
the results of Sivian & White for minimum audible pressure (94). Dadson & 
King’s data show greater sensitivity outside this range. Histograms at each 
frequency tested show how the absolute thresholds are distributed. Wheeler 
& Dickson (109) considered carefully the factors that are important in the 
standardization of audiometry. Their subjects were males with normal ears 
between the ages of 18 and 23. The precision measurements were made from 
256 to 8192 c.p.s., and frequency distributions of absolute thresholds based 
on 1028 ears are shown graphically. 

Heise (45) determined for the pigeon the absolute thresholds for pure 
tones and for white noise by an operant conditioning technique. Sensitivity 
to pure tones was similar to that of human subjects at low frequencies, but 
the thresholds for the pigeons increased abruptly above 4000 c.p.s. 

Free-field versus ear canal pressures: the missing 6 db.—In a series of ex- 
periments, Munson & Wiener (79) have tried to resolve the mystery of the 
missing 6 db (9, 89, 94). The problem is simply this: at low frequencies, be- 
low about 300 c.p.s., an earphone must generate a greater sound pressure in 
the ear canal than a free-field sound pressure that has the same sensory effect. 
The explanation of this difference is difficult because the human head is very 
small compared to the wave lengths of these low frequencies, and therefore 
diffraction effects are negligible (110, 111). Furthermore, even when the 
sound pressure is measured in the ear canal for the two conditions of listen- 
ing, there is the discrepancy of at least 6 db in order to produce the same re- 
sponse. From Thévenin’s theorem one would expect that the single ear would 
not care whether the sound pressure was generated by an earphone or by a 
loudspeaker. But somehow the listener takes the difference in the source of 
the sound into account and thereby requires a difference in intensity of about 
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6 db in judging the two sounds equivalent. Now, at the quiet absolute thresh- 
old this difference in the energy required between listening to an earphone 
and to a loudspeaker might be the result of the masking effect of physio- 
logical noises, such as the rush of blood and the pounding of the heart, or of 
the seashell effect created by the presence of the earphone and muscle tremor 
(13). But at higher intensities of the stimulus these masking noises are negli- 
gible, and equal loudness matches between a tone from an earphone and a 
tone of the same frequency from a loudspeaker end up with 6 db missing. 

In addition to (a) the hypothesis of masking by self-noise, Munson & 
Wiener consider the following factors that might be responsible for the dis- 
crepancy at higher stimulus intensities. (b) The hypothesis of poor receiver 
fit—that the low frequency leak was less during the measurements of the 
sound pressure by the use of a probe tube than during the loudness balance. 
But in their new experiments the sound pressure was measured during the 
progress of the loudness balance, and the hypothesis discarded because the 
pressure difference was still about 6 db. (c) That the static pressure attribut- 
able to the application of the earphone to the ear might be responsible was 
abandoned because they found that this static pressure diminishes rapidly 
to less than 1 mm. of mercury during a loudness test. (d) The middle ear 
muscles might contract when an earphone is applied to the pinna. Their ex- 
periments on this point make it an unlikely factor. 

Munson & Wiener also report the following experiment. Earphones are 
worn throughout a series of loudness balances. The listener matches in 
loudness a tone generated by the earphones to a tone of the same frequency 
that is generated by a loudspeaker but which reaches the receptors by leaking 
under the receiver caps or by any other available path. Probe tubes (78), in- 
serted into the ear canals with the earphones on the ears, measure the sound 
pressures during the loudness balance. In one experiment, no significant dif- 
ferences were found between the sound pressures from the two sources. But 
in a later experiment the sound pressure under the earphone was again about 
6 db greater when the sound was coming from the earphone than when the 
tone was leaking into the ear canal from the loudspeaker. It is probably im- 
portant that the experimenters could not conceal from the listener the source 
of the sound. This last experiment and its replication with their disconcerting 
results still leave us with a baffling problem. 

Related to the problem just discussed are Benson’s (8) cautions regarding 
the calibration and use of probe-tube microphones. 

Masking.—Webster et al. (107) secured the masked absolute thresholds 
for pure tones in the presence of a thermal noise that had the energy rejected 
over part of the frequency scale. For a noise with a gap that is one octave in 
width, a sinusoidal signal at the center of the gap is just audible when its 
intensity is about 25 db less than that required in the presence of unfiltered 
noise. These results are of practical interest, because a steady noise with a gap 
in it might be used to make transient, extraneous noises less distracting with- 
out reducing the audibility of the signal. They also measured the change in 
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pitch of pure tones near the abrupt slopes in the noise spectrum, and their 
results agreed substantially with earlier findings. 

Frequency discrimination—The arduous task of measuring the smallest 
change in frequency or intensity that can be detected by the human listener 
needs to be undertaken from time to time. Our techniques improve, and our 
understanding of auditory discrimination enables us to define more precisely 
just how the data should be secured. For over 20 years the data of Shower & 
Biddulph (93) have been used to define the resolving power of the ear with 
respect to frequency. These investigators employed a form of frequency 
modulation, and it has never been clear that their results were obtained under 
stimulus conditions that exhaust the listener’s capacity for frequency dis- 
crimination. Harris (41) has given the questions of stimulus control, psycho- 
physical method, and training of subjects careful consideration and has pro- 
vided us with new data on pitch discrimination at various frequencies from 
60 to 4000 c.p.s. Below 1000 c.p.s., his results differ substantially from Shower 
& Biddulph’s, and it seems clear that different auditory functions are meas- 
ured by the two studies being discussed. The present writer would maintain 
that Harris’ stimulus conditions are preferred over Shower & Biddulph’s for 
the purpose of defining pitch discrimination. 

Auditory flutter —Pollack (83) furthered our understanding of auditory 
flutter (a) by measuring thresholds of detection of changes in the rate of in- 
terruption of a white noise, (b) by constructing a psychological scale of audi- 
tory flutter, and (c) by determining the accuracy of estimation of the rate of 
interruption. With this information at hand, Pollack considered the question 
of whether the neural mechanism underlying the perception of auditory 
flutter is substitutive or additive. Since the subjective size of the j.n.d.? of 
auditory flutter is roughly proportional to the number of j.n.d.’s above thresh- 
old, he concludes that the neural mechanism for the perception of a period- 
ically interrupted white noise is more like that of loudness (additive) than 
like that of pitch (substitutive). 

Loudness of noise——Pollack (84) determined equal loudness contours for 
bands of noise and found that these contours are closely similar to those for 
pure tones. He measured also the contribution to the loudness of noise made 
by various frequency bands over a wide range of noise levels. Near the quiet 
absolute threshold, the stimulus is slightly more effective if the noise energy 
is concentrated in a narrow frequency band than if it is spread over a wider 
band. However, above threshold, a noise of constant band pressure level 
(constant overall S.P.L.?) is louder if the energy is scattered along the fre- 
quency scale than if it is concentrated. Pollack interprets his findings by em- 
ploying the assumption that the displacement of the basilar membrane is a 
nonlinear function of the mechanical excitation. He assumes that a narrow 
band of noise quickly overloads the system, whereas, with a wider band of 
noise, frequency analysis of the cochlea results in less displacement and less 
overload at any given point. 

Another explanation might be, however, that the overload or nonlinearity 
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comes about from the distribution of thresholds of elements in the various 
neural channels. The all-or-none pulses in the nerve fibers that represent the 
energy concentrated in a given frequency region of the acoustic stimulus 
soon reach a maximal rate (34), and it might be that the spreading of the ex- 
citation along the basilar membrane brings in more responding fibers than 
does an increase in intensity at a given locus. This latter interpretation does 
not assume mechanical overload until more intense levels of excitation are 
reached. It is here offered as an alternative interpretation to that of Pollack’s 
because he found that the weakest intensity at which it is more effective to 
spread the noise frequencies than to concentrate them is far below the inten- 
sity at which the cochlear microphonic begins to show overload. 

Volume of noise—Thomas (101) extended his earlier investigation of the 
volume of pure tones (100) to the volume of bands of noise. Bands of noise 
of various widths were compared with a narrow, standard band with respect 
to both loudness and volume. The equal-loudness equations differed system- 
atically from those of equal volume. As a band of noise is increased in width, 
its band pressure level must be decreased in order to maintain equal loudness, 
and then it must be further decreased in intensity in order to maintain equal 
volume with the narrow, standard band. The results based on the loudness 
matches agree substantially with those obtained by Pollack (84). 

Three articles (37, 57, 106) on the effect of interaural phase differences on 
the masked thresholds and the perception of pure tones must be left for a 
later reviewer. 

AUDITORY FATIGUE 


Auditory fatigue does not refer to a specific state or condition of the end- 
organ and nerves. The defining operation merely consists in a measured in- 
crease in the quiet absolute threshold following stimulation. Auditory fa- 
tigue, defined in this way, can be obtained under a wide range of stimulus 
conditions, but it now appears unlikely that the mechanisms responsible for 
the threshold changes are the same for the various conditions. Zwislocki & 
Pirodda (114) provide a convincing argument for distinguishing between 
adaptation (short-duration auditory fatigue or residual masking) and tran- 
sient acoustic trauma (temporary hearing loss or stimulation deafness). If 
the threshold shift is measured at a number of frequencies following exposure 
to a pure tone, a curve showing the spread of adaptation or fatigue is obtained. 
Their experimental analysis indicates that adaptation has the following 
characteristics: The maximum amount of adaptation occurs at the frequency 
of the fatiguing pure tone and is even found after stimulation with very weak 
intensities. Furthermore, immediately (within 50 msec.) after cessation of 
intense stimulation with a pure tone, the greatest threshold shift occurs at 
the frequency of the fatiguing stimulus. Transient acoustic trauma, as the 
authors term it, is greatest about one half octave above the fatiguing stimu- 
lus. It is evident only after relatively intense stimulation and after a rest 
period. The acoustic trauma persists much longer than the threshold shift 
resulting from adaptation. Zwislocki & Pirodda conclude that adaptation de- 
pends upon a decrease in the sensitivity of the sensory cells and of the pe- 
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ripheral nerve fibers, whereas transient acoustic trauma has its locus in the 
sensory cells. 

Since complete recovery is obtained even with severe temporary hearing 
losses, it is not yet clear whether permanent acoustic trauma is an extension 
of the same process as transient acoustic trauma, or is a different type of 
injury, such as an anatomical dislocation of the hair cells. 

Rawnsley & Harris (85) have continued their studies of short-duration 
auditory fatigue. In the present study they investigated the effect of the in- 
tensity of the fatiguing stimulus on the degree of fatigue with the recovery 
interval as a parameter. With short intervals of time between the cessation 
of the fatiguing stimulus and the onset of the test tone, fatigue is demon- 
strable after stimulation for only 400 msec. at a sensation level of only 10 
db. From such data it is possible to plot the course of recovery from auditory 
fatigue. Their data show that at low fatiguing intensities the amount of 
fatigue (threshold shift) is independent of the duration of the fatiguing stim- 
ulus, and this result is in support of the contention of Zwislocki & Pirodda 
that short-duration auditory fatigue is a different process from transient 
acoustic trauma. 

Rawnsley & Harris (86) attempted to determine the ‘‘tuning”’ of the 
cochlea by measuring in considerable detail the spread of fatigue (or residual 
masking) to frequencies closely adjacent to the fatiguing tone. The fatiguing 
tone was sounded for 435 msec. After a silent interval of 35 msec., the test 
tone (50 msec. in duration) was presented. They found nearly uniform fa- 
tigue from 990 through 1010 c.p.s. following exposure to a fatiguing stimulus 
of 1000 c.p.s. However, since the tone used to test for a threshold shift was 
of very short duration, it consisted of a narrow band of energy. If the cochlea 
had infinite resolving power, the curves relating fatigue to the center fre- 
quency of such a test stimulus would have a broad maximum about as wide 
as that of the test stimulus. Hence, the authors should not be disturbed by 
the lack of sharp “‘tuning’’ as measured by their technique. 

Hirsh & Ward (51) studied the recovery of the absolute threshold after 
strong acoustical stimulation. The most important aspect of their results is 
that the recovery from fatigue is not always a simple monotonic function, 
but is in many cases diphasic. A typical diphasic recovery curve shows an 
initial rapid recovery occupying about 1 min., with a subsequent rise in the 
threshold followed by its slow return to normal. Such recovery curves are 
most apparent when the fatiguing stimulus is of low frequency, especially 
500 c.p.s. In the first experiments a click was used to trace the course of re- 
covery from fatigue. Since this click had a broad spectrum, the recovery from 
fatigue was also investigated by measuring the absolute threshold for bands 
of noise each 250 mels wide, and for pure tones. With a fatiguing stimulus of 
500 c.p.s., 120 db S.P.L., and 3 min. of stimulation, it was found that the 
bounce or diphasic character of the recovery curve shows up most clearly 
for bands of noise, and for pure tones, between 1000 and 5000 c.p.s. Many of 
their diphasic recovery curves show that the absolute threshold at the maxi- 
mum of the bounce is greater than the threshold measured immediately after 
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cessation of the fatiguing stimulus. Wide individual differences are reported 
as to the degree of the bounce in the recovery curve, but its consistent ap- 
pearance for certain subjects upon repetition of the same conditions estab- 
lishes that the process of recovery from fatigue is in these cases complex. 
Although there were important differences between the procedures used by 
Hood (52) and those used in this study, nevertheless it is surprising that 
Hood did not record diphasic recovery curves. Only further research can 
resolve this discrepancy. 


SENSORI-NEURAL DEAFNESS 


Until recently, it was accepted doctrine that peripheral deafness, as op- 
posed to central deafness, could be classified into two distinct types. Pure 
conductive deafness, or transmission-deafness, results from disturbances in 
the mechanical transmission system. In the treatment of transmission-deaf- 
ness, the hearing loss can be successfully considered as identical with the 
insertion of an equivalent attenuation in a communication circuit terminated 
by the normal ear. Nerve or “perceptive” deafness results from sensori-neural 
dysfunction. In many cases of nerve deafness, the resulting hearing loss can- 
not be compensated for by an increase in the gain of a linear communication 
circuit, because the rate of change of loudness with respect to intensity is 
greater in these cases than for the normal listener. For over 15 years, the 
accepted explanation of this phenomenon of loudness recruitment has been 
in terms of the loss of sensori-neural units that become an increasingly smaller 
percentage of the total number of activated fibers as stimulus intensity in- 
creases. Injury to the hair cells has been considered physiologically equiva- 
lent to injury to the VIIIth cranial nerve (97, p. 288). 

The recent studies, principally from the clinic, may force us to abandon 
this simple theory of recruitment and to recognize three types of peripheral 
deafness: transmission-deafness, cochlear or scalar deafness, and VIIIth 
nerve or neural deafness. At the same time these investigations may provide 
psychophysical tests that will help the clinician to differentiate a lesion or 
malfunction in the hair cells from damage to the VII Ith nerve. 

The distinction between these three possible type of peripheral deafness 
revolves about the phenomenon of loudness recruitment. Transmission-deaf- 
ness may be distinguished from sensori-neural deafness by a comparison of 
the absolute thresholds obtained by bone and by air conduction. Theoreti- 
cally, measures of recruitment are unnecessary to establish whether or not the 
increased threshold of hearing is a result of transmission loss or to hair cell 
and nerve injury. However, the phenomenon of recruitment appears at the 
moment to provide a means of distinguishing scalar from VIIIth nerve dis- 
turbances. 

It now seems that loudness recruitment is correlated with cochlear or 
scalar disturbances, especially with damage to the hair cells, whereas in 
VIIIth nerve injury recruitment is partial or absent. Most of the cases to 
be reported of cochlear disturbances with probable hair cell involvement are 
those with Méniére’s disease, in which the cardinal symptoms are vertigo, 
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tinnitus, and deafness. In their fundamental research, Dix, Hallpike & 
Hood (23) state that the structural changes in the inner ear found in Mé- 
niére’s disease are limited to Corti’s organ, and that the nerve fibers and cells 
of the spiral ganglion are normal. In their 30 cases with Méniére’s disease, 
loudness recruitment was present and complete. 

In order to study the effect on hearing of lesions in the VIIIth nerve, 
patients with tumors of the cerebellopontine angle have been given various 
tests, both direct and indirect, of recruitment. Dix et al. (23) reported that 
recruitment was absent in 14 of their 20 cases of VIIIth nerve degeneration. 

Because of the great importance of these phenomena, both in their theo- 
retical and clinical aspects, several papers dealing with this topic will be re- 
viewed in detail and at least brief comment devoted to several others. Im- 
portant papers not reviewed here are Flett (30), Fowler & Zeckel (31), Fur- 
stenberg (32), Lempert et al. (64), Lindsay (66), Lindsay et al. (67), McKen- 
zie (74), and Williams (112). These studies have appeared over the past two 
years. 

Measures of recruitment.—The direct measure of recruitment in terms of 
loudness balances is tedious, time-consuming, and not always possible. Many 
other measures which might indicate recruitment have been explored and sev- 
eral found valuable. The ensuing discussion will be clarified by a summary of 
the principal measures of recruitment which show characteristic differences 
in the two types of deafness (see Table I). For a more complete list and dis- 
cussion of these measures, see Garner (36), Hirsh (49), and Harris (42). 

The new clinical evidence—Eby & Williams (24) employed Fowler’s 
“alternate binaural loudness balance test,’’ in the study of (a) 32 patients 
with unilateral deafness in which the diagnosis was Méniére’s disease (end- 
organ lesion), (b) one case with unilateral deafness considered to be the result 
of a vascular accident involving the organ of Corti, and (c) 16 patients with 
unilateral auditory nerve deafness attributable to a variety of causes. 

Consider first the 33 cases whose deafness was presumably a result of 
lesions of the end-organ. Loudness recruitment was present in all cases and 
was incomplete in only two of the 33; diplacusis binauralis dysharmonica 
was present in 23 of the 27 cases tested. In 15 of the 17 tested for speech in- 
telligibility with phonetically balanced lists, the per cent correctly received 
reached a maximum and then declined. The range between the threshold for 
speech reception and the most comfortable listening level (MCL)? was nar- 
rowed to 18 db. The threshold of discomfort for speech was within the limits 
of their audiometric scale in all cases tested. 

In eight of the 16 cases with auditory nerve deafness, the diagnosis of 
nerve fiber involvement was substantiated by the operative findings. For 
these eight cases, VIIIth nerve tumors and lesions in the cerebellopontine 
angle were found. Loudness recruitment was not present for these eight 
cases, and only partial recruitment occurred in three of the remaining eight 
patients. Diplacusis binauralis was not found in 15 of the 16 tested. In five 
of the six cases given speech tests, the articulation scores continued to im- 
prove up to the limit of the intensity of the audiometric scale. The MCL 
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THE TyPIcAL RESULTS OF MEASURES OF RECRUITMENT THAT SEEM TO DISTINGUISH 
SCALAR FROM VIIITH NERVE DEAFNESS 








Measure 


Scalar Deafness 


VIIIth Nerve Deafness 





Binaural loudness bal- 
ance 


Rapid growth of loud- 
ness: recruitment 


Normal growth of loud- 
ness: no recruitment 





Equal-loudness contour 


Near normal at high in- 
tensities 


Same aberrant contour as 
shown at threshold 





Variability of absolute 
threshold (Békésy audio- 
gram) 


Small variability at fre- 
quencies which show re- 
cruitment 


Normal variability 





Difference limen for in- 
tensity near threshold 


Same size as at high sen- 
sation levels 


Same as normal ear at 
low sensation levels 





Short-duration auditory 
fatigue 


Greater fatigue corre- 
sponding to greater loud- 
ness 


Amount of fatigue same 
as normal ear with same 
sensation levels 





Diplacusis binauralis 


Marked pitch distortion 


Little pitch distortion 





Hearing loss for speech 


Not discriminative 


Not discriminative 





Discrimination loss for 
speech 


Marked loss, which in- 
creases with increasing 
intensity 


Loss relatively constant 
as intensity increases 





was wider for this group with nerve fiber deafness than for the 33 with end- 
organ damage. It averaged 29 db. The threshold of discomfort was beyond the 
limit of the audiometric scale. 

This study with its excellent case material gives clear support to the 
thesis that scalar deafness has different effects on hearing from neural deaf- 


ness. 


Liischer (71) in a paper based on extensive experience in the detection of 





recruitment by measurement of the DL? for intensity further documented 
the thesis that the presence of recruitment generally indicates a cochlear 
lesion. Considerable evidence is added to previous data that show the iden- 
tical behavior of recruitment and the DL for intensity. In another paper 


‘ It must be stressed that this Table is presented primarily to contrast the (prob- 
able) differences between two types of sensorineural deafness. Cases showing incon- 
sistent results have been reported which are, perhaps, attributable to mixed deafness, 
but the data do not permit any final conclusions. Of those measures listed, short- 
duration auditory fatigue is the least documented for the differential diagnosis of 
scalar and VIIIth nerve deafness. 
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Liischer & Zwislocki (72) compared the results of their technique, which 
employs amplitude modulation for measuring the DL for intensity, with that 
used by Denes & Naunton (21), and prefer the use of amplitude modulation 
in the detection of recruitment. 

Jerger (58) simplified Liischer’s technique for the determination of the 
DL for intensity, standardized his procedure on 39 normal ears and then ad- 
ministered this test to 89 cases showing hearing losses. Of these 89, 15 were 
pure conductive cases who showed DLs of the same magnitude as the normal 
ear. The remaining 74 cases comprised the group showing a sensori-neural 
loss of which 53 had abnormally small DLs, thus indicating recruitment. The 
otological diagnosis, made independently of Jerger’s tests, support the con- 
tention that the DL is small in acoustic trauma, Méniére’s disease, and en- 
dogenous deafness. Normal DLs were found in cases diagnosed as cerebello- 
pontine angle tumor, streptomycin damage, and disorders of the central 
nervous system. 

Lund-Iversen (70) determined the DL for 10 normal and 71 deaf individ- 
uals. These measurements apparently were made with the stimulus 40 db 
above the patient’s threshold, and no differences in the size of the DLs were 
found. It would seem that the DL for intensity will indicate recruitment best 
if the measurement is carried out at low sensation levels. At high intensities 
the DL for the recruiting ear will be nearly the same as for the normal ear. 

In three of the above studies (58, 70, 71), abnormally large DLs were re- 
ported in a few cases. There seems to be agreement that these large DLs have 
no peripheral organic basis for the indicated hearing loss. 

Hood (53), and Hallpike & Hood (40), have continued their studies of 
auditory fatigue and its relationship to clinical tests of deafness. It will be 
recalled (52) that in Hood’s study of perstimulatory auditory fatigue, a 
loudness balance was made with the two ears simultaneously stimulated, 
but with the fatiguing tone having been on for a period of time. The con- 
siderable degree of fatigue measured in this way contrasts with the absence 
of fatigue measured by matching two tones alternately presented, one to 
each ear. Hood explains the difference in the results obtained by these two 
methods in terms of the on-effect. In their more recent reports (40, 53), the 
auditory fatigue of normal and deaf individuals was determined by continu- 
ous and simultaneous stimulation of both ears. Of course, in the normal 
listener, no fatigue is demonstrable by this technique, since one ear fatigues 
as rapidly as the other. In unilateral transmission-deafness, as fatigue pro- 
ceeds, no change in stimulus intensity in the normal ear is required to bal- 
ance the loudness of the tone in the deaf ear. The initial loudness balance 
introduces a difference in intensity between the two ears such that the me- 
chanical excitation in the two cochleas is the same. From then on, fatigue 
proceeds in the ear with transmission-deafness at the same rate as in the 
normal ear. A very different result is obtained when this procedure in em- 
ployed in cases of unilateral Méniére’s disease. Here, the difference is abso- 
lute thresholds between the normal and impaired ear still shows up in loud- 
ness balances made at low stimulus intensities. But, as the two tones con- 
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tinue to sound, one in each ear, fatigue proceeds more rapidly in the impaired 
ear than in the normal, and the intensity in the normal ear must be decreased 
for a loudness balance. When high stimulus levels are employed, only a 
small difference in intensity is initially required for a loudness balance, and 
this fact is recruitment in the impaired ear. However, as stimulation continues 
the final balancing intensity in the normal ear must be reduced so much that 
little or no recruitment is manifest. They conclude that over a period of con- 
tinuous stimulation the phenomenon of loudness recruitment in Méniére’s 
disease is transient, and that in the adapted ear the impairment is still meas- 
ureable at high stimulus intensities. 

Pirodda & Zwislocki (82) continued the study of short-duration auditory 
fatigue as a measure of the recruitment phenomenon. They conclude that 
the measurement of fatigue over a considerable range of frequencies is ne- 
cessary for adequate demonstration of the presence of recruitment. They be- 
lieve that this testing technique is probably too protracted for practical pur- 
poses of routine testing. 

In a detailed and thorough monograph, Lundborg (69) evaluated the 
variability of the absolute threshold (Békésy audiograms) obtained on 72 
cases with different types of sensori-neural deafness. In scalar lesions or end- 
organ deafness (acoustic trauma or Méniére’s disease), the Békésy audio- 
gram indicated complete recruitment, while in retrocochlear lesions of the 
auditory pathways (intracranial tumor) the majority of the audiograms indi- 
cated absence of recruitment. 

Kristensen & Jepsen (61) also stress the importance of determining the 
presence or absence of recruitment in the differential diagnosis between 
Méniére’s disease and acoustic neurinoma. 

The numerous investigations concerned with loudness recruitment make 
it clear how interrelated are the various functions of hearing; for example, 
Bocca (11) has described a new test for the diagnosis of various types of 
peripheral deafness. The intensity of a temporally continuous noise is in- 
creased until it just masks a tone of 15 db (sensation level). The noise is then 
interrupted at a rate of one per second and the subject can usually again hear 
the tone. The rate of interruption is increased without altering the burst 
intensity until the tone is again just masked (see also, 77). This critical rate 
of interruption turns out to be the same in cases of conduction and VIIIth 
nerve deafness as in the normal. In the cases of cochlear deafness, but with 
the exception of those with Méniére’s disease, the critical rate was much 
lower than the normal. 

In several of the investigations on loudness recruitment the phenome- 
non of hyper- or over-recruitment is reported. For an equal loudness match, 
less stimulus intensity is required for the recruiting ear than for the nor- 
mal, contralateral ear. Such over-recruitment is never more than a few 
decibels. Harris (42) says that there is no good explanation of this phenome- 
non of over-recruitment. However, Hirsh (49, p. 219) points out that if the 
“normal” ear has a slight transmission loss, say 5 db, and if at high intensi- 
ties equal mechanical excitation in the two cochleas is required for equal 
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loudness, then the ‘‘normal”’ ear with a slight transmission-deafness will re- 
quire a greater intensity of the stimulus, by 5 db in this example, than that 
required for the deafened, completely recruiting ear. 

The relation between the intelligibility of speech and the intensity at 
which it is received (articulation-gain function) provides important diag- 
nostic information. In pure transmission-deafness, the articulation-gain 
function is simply translated with unaltered form with respect to the axis 
that represents the intensity of received speech. In sensori-neural deafness, 
however, the function is not only translated with respect to intensity, but 
the maximum articulation score is considerably less than for the normal ear. 
The loss in discrimination for speech cannot be compensated for by increas- 
ing the intensity of the speech. 

Huizing & Reyntjes (54) provide further evidence that the articulation- 
gain function for the ear showing marked recruitment reaches a maximum 
and then declines as the intensity of speech is increased. The recruitment 
phenomenon results in a rapid growth of loudness as speech is increased in 
intensity. However, the severe distortion correlated with recruitment appar- 
ently causes a decline in the intelligibility of speech. 

Azzi (3), in a lengthy discussion of speech audiometry, also reports this 
third type of articulation-gain function. 

Cawthorne & Harvey (15) found that the articulation gain-function for 
37 out of 40 cases with Méniére’s disease reached a maximum and imme- 
diately declined. The decrease in the percentage of words recognized as the 
speech intensity was increased was so great that they describe the function 
as ‘“‘hump-backed.” In these cases, the pure tone audiogram often showed 
only a moderate loss, and this loss was usually uniform with frequency. 
Cases with severe traumatic deafness also showed the ‘“‘hump-backed”’ ar- 
ticulation curve. Day (20) states that the pure tone audiogram does not 
nearly describe the great difficulty in understanding of speech that certain 
patients show with Méniére’s disease. Day points out that roaring tinnitus 
and distortion attributable (probably) to severe recruitment conspire to re- 
duce markedly the ability to understand conversational speech. Barber & 
Ireland (4) concur with Day that the final assessment of retained hearing 
should not be based on the patient’s ability merely to detect the presence of 
a stimulus, and Knight & Littler (60) believe that it is generally more satis- 
factory to evaluate the speech audiogram itself than to predict it from the 
pure tone audiogram. Some of the procedures and advantages of speech 
audiometry are pointed out by Walsh (105). He also reported that recruit- 
ment or over-recruitment was present in almost every case of Méniére’s dis- 
ease. The discrimination loss was universally high. In many cases these high 
discrimination losses were in contrast to the relatively moderate hearing 
loss. Furthermore, an end-organ lesion seems to be indicated by a high dis- 
crimination loss with a relatively small hearing loss for speech. 

Bocca & Pellegrini (12) obtained articulation-gain functions with normals 
in which various types of deafness were, simulated by using frequency se- 
lective circuits both with and without expansion. They matched the fre- 
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quency selective circuits to the audiograms and in many cases the resulting 
articulation curves for the normal ears obtained with frequency distortion 
and expansion resembled those of patients with frequency distortion and 
recruitment (expansion?) arising from disorders of the ear. They believe 
that the marked frequency distortion present in patients showing high-tone 
deafness is compensated for by recruitment, especially in the reception of 
meaningful words. Studies of this type are far too rare and this writer be- 
lieves that a more complete understanding of deafness would be achieved by 
simulating deafness in its various forms by means of communication circuits 
and masking noises. 

The methods developed by Fletcher for calculating articulation scores 
for communication systems both with and without noise, but for normal 
listeners, are applied (28) after simplification to the perception of speech by 
persons with abnormal hearing. There is little question that the hearing loss 
for speech (threshold shift in decibels) can be predicted from the pure tone 
audiogram with considerable accuracy from well-known formulae. To pre- 
dict, as Fletcher wishes to do, the entire course of the articulation curve using 
difficult test items for cases with either nerve or mixed deafness is quite 
another matter. The articulation index is obtained as a product of four fac- 
tors: the frequency distortion (relative response at different frequencies), the 
sensation level of the speech sounds, the ear (amplitude) distortion which de- 
pends on how near the sounds are to the threshold of feeling, and the pro- 
ficiency factor which depends on how well the listener knows the speech 
sounds. Both bone and air conduction audiograms for pure tones are needed 
so that the degree of transmission-deafness and nerve-deafness may be eval- 
uated separately. The effect of transmission-deafness is taken as identical 
with a loss in gain, and the effect of nerve-deafness is considered to be iden- 
tical with the effect of masking in the normal ear. The computations are 
made by applying appropriate weighting functions to each of six frequencies. 
The details are too involved to present here. The agreement between the 
calculated and obtained scores is fairly good, but a final evaluation of the 
generality of Fletcher’s methods must wait until data are obtained from 
patients known to have either scalar or VIIIth nerve lesions. From the re- 
ports already reviewed, it would seem that an additional factor that handles 
marked recruitment will be required in certain instances of scalar deafness. 

Very little work has been reported on animal research on labyrinthine 
hydrops. One interesting study is that of Fernandez, Ludwig & McAuliffe 
(27) who were interested in standardizing the procedure of electrocoagula- 
tion of the vestibule in the treatment of hydrops of the labyrinth. They re- 
corded the cochlear microphonic and action potentials both before and after 
various forms of surgical insult to the membranous labyrinth of guinea pigs, 
and then examined the structures histologically. The increases in the elec- 
trical ‘‘thresholds,’’ both cochlear microphonic and action potential, corre- 
lated well with the degree of injury to the hair cells. 

This topic of recruitment and types of deafness has been discussed at 
length because of its great importance, not only to the clinic, but to the sen- 
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sory psychologist. If it turns out that two distinct types of sensori-neural 
deafness must be recognized, then new evidence may be forthcoming to help 
us solve such problems as the mechanism of stimulation of the auditory 
nerve cells, the physiological basis of loudness, diplacusis binauralis, tinnitus, 
the discrimination of speech sounds, and other phenomena. 

Theory.—The two types of sensori-neural deafness might arise from the 
following mode of action of the hair cells and the VIIIth nerve. With hair 
cell damage, the associated nerve elements are not activated at low stimulus 
intensities and the absolute threshold is increased. At higher levels of the 
stimulus, the activity of the hair cells responsible for firing the nerve spreads 
in locus and brings into action those nerve elements which normally would 
be fired by their own sensory cells. At still higher levels, the VIIIth nerve is 
discharging as if all hair cells were intact. Loudness is then fully recruited. In 
VIIIth nerve damage, certain nerve elements are permanently lost, and even 
at high intensities they remain nonfunctional. If at each stimulus intensity 
the number of inactive elements is a constant fraction of the active fibers 
[Dix et al. (23)], little or no loudness recruitment would occur. 

The present writer suggests that discrimination loss for speech is severe 
for the ear showing end-organ damage because (a) dL/dI is abnormally 
large, and (b) with speech as a stimulus dJ/dt is also large, therefore dL/dt 
is very great. For the ear showing VIIIth nerve damage, dL/dt for speech is 
probably reduced over that for the normal ear, and the discrimination loss 
resembles the effect of a passive filter. 


AUDIOMETRIC CHANGES FOLLOWING FENESTRATION 


A few of the studies regarding the effect of the fenestration operation 
upon hearing loss and methods for predicting this loss will be discussed. Other 
reports not reviewed here are those of Lempert (63), Skoog (95), McKenzie 
(75), and Asherson (2). 

Shambaugh (92) presents the results of the measurement of hearing loss 
both before and after the fenestration operation in 164 consecutive cases. Of 
importance is the simple formula developed for predicting the probable out- 
come of the operation. This formula is based on two assumptions: (a) it is 
assumed on good evidence (14) that the bone conduction audiogram suitably 
corrected for the Carhart or otosclerotic notch is a reliable and valid measure 
of cochlear reserve, and (b) with factual support (19) it is also assumed that 
the successfully fenestrated ear has a definite residue of unrestored conduc- 
tive loss attributable to the altered mechanics of sound conduction. The for- 
mula predicts surprisingly well the actual hearing by air conduction six 
months following fenestration. 

McConnell & Carhart (73) confirmed the observation that the bone con- 
duction audiogram obtained from patients with stapes fixation gives an under 
estimate of the true cochlear reserve. Their results also agree with those of 
Davis & Walsh (19) in showing that the minimum hearing loss to be ex- 
pected after this operation is 20 db, and that the maximum preoperative 
conductive loss produced by stapes fixation is 60 db. 
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Thurlow et al. (103) give us a further description of the long-term study 
of the results of both speech and pure tone tests of hearing administered 
routinely both before and after the fenestration operation. This statistical 
study based on about 100 additional patients confirms the conclusions of 
their previous studies. This series of papers might well serve as a model for 
this type of evaluation of clinical procedures in audiology. 

Cawthorne & Harvey (15) add more evidence that a close correlation 
exists between the hearing loss for pure tones and for speech in pure trans- 
mission-deafness. In mixed deafness, the correlation is somewhat reduced. 

Nilsson (80) measured with a free-field audiometer the change in hearing 
for pure tones at various stages during fenestration operations performed 
under local anesthesia. In most cases when the labyrinth is opened, there is 
an immediate improvement in hearing for both air and bone conduction. 

Link & Zwislocki (68) go into considerable detail regarding the difficulties 
in the measurement of the cochlear reserve by means of bone conduction 
audiometry. Such audiometry is necessary since Fowler’s recruitment test 
cannot be relied upon alone to distinguish transmission-deafness from retro- 
cochlear lesions. As a result of the complex way in which the ear is stimulated 
by a bone conduction receiver, nearly the least favorable point of application 
of the stimulus for diagnosis is the mastoid process. They compared bone 
conduction thresholds measured at the mastoid and the temple in normal 
ears and in patients with middle ear infections. They masked the contra- 
lateral ear by means of a narrow band of noise, centered about the frequency 
of the test stimulus, and the intensity of this noise was automatically ad- 
justed to maintain the appropriate degree of masking as the intensity of the 
test stimulus was varied. The principal measurements were made with the 
ear canal not occluded. Their results confirm Békésy’s earlier findings that 
the threshold measured at the temple is about 10 db more than if measured 
at the mastoid process, and that this difference is essentially independent of 
frequency. They conclude that tests made at the temple circumvent the 
effects of the middle ear upon the bone conduction thresholds better than 
when measured at the mastoid process, and they recommend application of 
the receiver to the temple in spite of the loss in efficiency at this locus. 

Benedict’s study (7) is concerned with the correlation between anatom- 
ical structure as altered by fenestration and hearing as measured by instru- 
mental avoidance conditioning. Clinical otosclerosis was simulated in the 
monkey’s ear by removing the incus (mastoidectomy), and thus interrupting 
the chain of ossicles. Removal of the incus resulted in a hearing loss of about 
45 db that was fairly uniform from 150 to 4000 c.p.s. Then, following Lem- 
pert’s surgical technique, K. Lowry fenestrated the ear in various ways, the 
animals’ hearing was tested, and histological examination was undertaken. 
Since the oval window was not occluded, the fenestration operation added a 
third functional window to the inner ear. No obvious correlation was found 
between the thresholds and the condition of the fistula. However, this find- 
ing should not be too disturbing, since there were (apparently) always at 
least two mobile windows in the inner ear. 
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Van Eyck (104) discovered a pigeon with a congenital ankylosis of the 
oval window that was similar to the stapes ankylosis in human otosclerosis. 
After electrophysiological study of the ear of this pigeon both before and 
after fenestration, he concluded that sound is transmitted to the cochlea by 
way of the round window, and that the function of the new window is to 
permit liquid motion. 

Huizinga (55) discussed the reaction of Tullio and the fenestration opera- 
tion, and made it clear why the patient is not troubled with this reaction. 


TINNITUS AND PARACUSIS 


Subjective tinnitus aurium is a fascinating phenomenon in the normal 
ear, but one which can be annoying or even intolerable to the patient suffer- 
ing from certain otologic diseases. Such tinnitus is usually localized at one 
or both ears, and it can be described with some specificity. Although tinnitus 
is extremely common in many otologic disorders little research has been di- 
rected toward more precise description or measurement of it. Tinnitus may 
change the sound of an acoustic stimulus and thus it resembles an illusion; 
but it can be heard in the quiet. Sometimes it is hallucinatory and is given 
objective reference; but under observation the same tinnitus may be local- 
ized in the ear and the hallucination disappears. Tinnitus may seem loud; 
but its equivalent acoustic intensity is never very great. Because of the dif- 
ficulties in obtaining accurate descriptions of the auditory quality and inten- 
sity of tinnitus, Goodhill (39) has made a recording of a variety of sounds so 
that the individual can make a more accurate description of his tinnitus. 

After a study of tinnitus aurium in both normal and deaf individuals, 
Heller & Bergman (47) conclude that the qualitative characteristics of the 
tinnitus experienced by the normals were similar to those reported by the 
patients. The relative frequencies of occurrence of different auditory quali- 
ties were nearly the same in the two groups. 

Sullivan & Hodges (98) studied the phenomenon of Paracusis Willisii 
by determining the quiet and masked absolute thresholds for pure tones. 
With the noise level used for masking, the masked thresholds above 1500 
c.p.s. obtained from patients with bilateral conductive deafness are on the 
average nearly the same as the masked thresholds for the normal ear. 

In the quiet, when speech is intended for the normal listener, the para- 
cusic is at a severe disadvantage. In noisy surroundings, however, when the 
speech intensity has been adequately increased for the normal listener, the 
paracusic may also hear. In this connection, see Kryter (62). 


OTHER DEVELOPMENTS 


This review does not include a considerable number of very fine articles 
on hearing and related topics which have appeared during the past year. 
Practically the entire field concerned with the physical analysis and the per- 
ception of speech has not been treated. At least we must mention the diversi- 
fied group of papers presented at the Conference on Speech Analysis, and pub- 
lished as a unit (16). There have been papers and monographs on audiom- 
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such as Palva’s Finnish Speech Audiometry (81), and the description 


of new recorded speech tests for the measurement of hearing and discrimina- 


tion 


loss (50). An adequate survey could not be accorded in this review to 


the problems that arise when man is exposed to the intense noises of indus- 
trial and military situations (88). Because these developments are too import- 
ant and involved to treat briefly, their summary and evaluation must remain 
for a later review. 
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THE CHEMICAL SENSES! 


By BERNICE M. WENZEL 
Department of Psychology, Barnard College, Columbia University, New York, New York 


In reviewing the chemical senses for Volume 1 of the Annual Review of 
Psychology, 1950, Geldard remarked that it was still too early to determine 
whether the interest then being shown would turn out to be more than a 
flurry. Since the number of articles reviewed here annually has tended to 
increase, it seems fair to say, after five years, that interest in the field is 
definitely strengthening, as evidenced not only by the number of publications 
but also by the appearance of at least some continuity. An encouraging 
symptom of vigor in an experimental area can be detected when one man’s 
conclusion becomes the source of the next man’s hypothesis. Such a symp- 
tom is now just noticeable, especially in studies of taste. Still another type 
of evidence is afforded by the ambitious and very well-attended two-day 
conference, sponsored jointly by science and industry, to discuss basic re- 
search problems in odor (4). The next five years can be expected to provide 
some real progress in experimental findings. 

Before we take up the research articles, a few reviews will be mentioned. 
The chemical senses have been plagued with too many reviews of too few 
facts but some which have appeared this year are contributions, nonetheless. 
Beidler (1) has written a summary of current information concerning taste 
receptors for a general semiscientific audience. It is both a comprehensive 
and comprehensible account and is highly recommended. Another review, 
this one on smell, is by Geldard (3). It, too, is an excellent summary of the 
principal facts and speculations in the field. This reviewer wishes, as the only 
change, that somewhat less attention had been given to systems of classifica- 
tion and a little more paid to the possibility of using olfactometers with more 
suitable controls than those of Zwaardemaker and of Elsberg. As an example 
of the type of review that is fairly common in scientific literature of industrial 
origin, that of Crocker (2) may be cited. The following quotation generally 
typifies the underlying theme of such articles. ‘Sensation is subjective. 
Nobody can touch or measure anything that is subjective. Sensation is 
unique for any particular person. It is dealt with by the psychologist” (2, 
p. 169A). 

GUSTATION 


Neurophysiology.—This is one of the most active areas, having several 
studies involving cortical or glandular extirpation. Bonvallet et al. (9) give 
the results of recording potentials from central gray matter evoked by 
stimulation of the afferent nerves of the cat’s tongue. Potentials with varia- 
ble latencies and amplitudes were recorded from the entire head of the 


1 The period covered by this review is approximately from June, 1952 to June 
1953. 
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caudate nucleus, the lateral and central nuclei of the amygdaloid complex, 
the claustrum, and the septal region. Stable responses appeared in the puta- 
men. Such data can be of assistance in charting the exact course of the entire 
gustatory pathway. It is not clear from the abstract, however, whether taste 
mediating nerves alone were stimulated. 

Benjamin & Pfaffman (8) investigated the role of a specific cortical area, 
as located by evoked potentials from the chorda tympani and the glosso- 
pharyngeal nerves, in the rat’s taste discrimination. The region excised was 
the inferior portion of Area 1 on the postorbital surface just above the rhinal 
fissure. The behavioral measure was the threshold for quinine aversion. It is 
hard to appreciate the full significance of the experiment from an abstract 
and a brief paper read at a meeeting, but the suggestion is that the cortical 
region concerned is of some importance for taste discrimination and that 
subcortical damage may be critical. These results seem to be in agreement 
with the taste localization by Patton & Amassian (24) based on a study of 
evoked potentials. 

It has been reported previously that temporal lobectomy in monkeys 
may produce dietary changes. Bagshaw & Pribram (7) have tried to shed 
some light on this phenomenon by studying changes in the acceptance thresh- 
old for quinine hydrochloride as a function of the region of temporal cortex 
removed. Acceptance was markedly increased following removal of the insula, 
operculum, and anterior supratemporal plane from three monkeys, and there 
was a slight increase with excision of only the insula and operculum in one 
monkey. Removal of the insula alone in one monkey or in combination with 
the supratemporal plane in another monkey had no effect. The authors sug- 
gest an analogy with vision, consisting of a primary system and a crucial 
center spot. This experiment could profitably be followed by a series designed 
to measure the effect of various combinations of damage on discriminations 
among taste stimuli, once the present findings are confirmed on more 
animals. 

In the same area of food habits, Anand & Brobeck (6) have shown that 
bilateral lesions in the lateral or medial group of amygdaloid nuclei of rats 
had no effect on amount of food eaten as compared with controls of the same 
age, although previous studies on dogs and monkeys had shown this effect. 

A worthwhile attempt was made by Jones & Jones (16) to repeat the 
Allen & Weinberg study (5) on fusion frequency of taste sensations from 
direct electrical stimulation of receptors. The earlier data had been offered 
as evidence for four distinct taste systems on the basis of different fusion 
frequencies. Jones & Jones were unable to obtain reports of anything but 
continuous sensations from six subjects, some being experienced and some 
inexperienced observers. Their reasonable suggestion is that the previous 
results should not be used as evidence until the cause of the discrepancies is 
made clear. 

The role of the chorda tympani and glossopharyngeal nerves in the rat’s 
sensitivity for sodium chloride and for quinine sulfate has been evaluated 
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by Pfaffman (26). Four experimental animals, with section of the chorda 
tympani alone and tested before regeneration was possible, were compared 
with four control animals. He found essentially no difference in the pre- 
ference-aversion function for sodium chloride, and only a slight lessening of 
aversion to quinine (one animal was added to each group for the quinine 
tests). Cutting both nerves in six animals resulted in significant reduction 
of both preference and aversion for salt and aversion for quinine. In other 
words, the ability to discriminate among the different concentrations was 
greatly reduced but not entirely gone. It appears, then, that taste stimula- 
tion, as mediated by these nerves, functions to a large extent in the prefer- 
ference-aversion responses to sodium chloride. After histological study, 
Pfaffman attributed what little sensitivity remained to taste buds of the 
pharynx and the roof of the mouth, those of the latter being on the papillae 
of the naso-incisor ducts. The discovery of taste buds in these ducts is re- 
ported here for the first time. 

Carr (11) has tested Richter’s hypothesis that receptor sensitivity is 
lowered with need by comparing the preference threshold, Richter’s measure 
of sensitivity, with the sensory threshold. The sensory threshold for sodium 
chloride was measured in 11 rats, before adrenalectomy, by a conditioning 
technique. The mean daily intake of 3 per cent sodium chloride and of tap 
water was also measured for 10 days. The postoperative measures for seven 
rats consisted of 10 days of comparative intake of salt solution and water, 
then measurement of the sensory threshold, followed by 10 more days of 
comparative intake. A significant increase occurred in amount of salt solu- 
tion drunk in both postoperative periods, but the postoperative sensory 
threshold did not change. The sensory thresholds found were comparable to 
those obtained by Pfaffman & Bare (25) in direct electrical recording of re- 
ceptor sensitivity. Carr’s results are certainly of great interest but probably 
should not be considered completely conclusive until alternative ways of 
measuring the sensory threshold have been tried (see Cubbage & MacPhee 
below). His procedure involved giving a shock if water was drunk. Use of 
shock in this way for threshold determinations is dubious in that it intro- 
duces an unassessed variable. 

Individual differences ——This section is something of a catchall for experi- 
ments dealing with individual differences per se as well as those concerned 
with factors which might increase variability among subjects. 

The following experiment was designed to test the same hypothesis as 
that of the last experiment in the preceding section. Starting from Richter’s 
premise, Meyer (21) tested the interesting hypothesis that the threshold for 
glucose in human subjects would change as a function of food deprivation. 
Nine subjects went without food for 34 hr. while the threshold for glucose 
was measured along with control thresholds for two less vital substances 
(sodium chloride and quinine bisulfate), at nine different times. At the end 
of the deprivation period they ate freely, and the thresholds were again 
measured. The curve of threshold against deprivation time for each substance 








114 WENZEL 


had essentially a zero slope. Analysis of variance showed the effect of depri- 
vation to be insignificant but intersubject thresholds were significantly 
different. These data are the third set which fail to support Richter’s hypoth- 
esis. It could be argued in this case that the need was not severe enough 
or prolonged sufficiently to affect the receptors. However, even at the end 
of the 34 hr. period, there was no sign of a downward trend. It seems to be 
time to start looking elsewhere for the explanation of increased intake of 
appropriate substances with need. 

Meyer’s experiment is also relevant to the reports of Goetzl’s group 
appearing in recent years on the effects of diurnal variations associated with 
ingestion of food on gustatory and olfactory sensitivity. This year Irvin & 
Goetzl (15) reported experiments that show a significant decrease in 
threshold for sodium chloride as time since last meal increases, and a signif- 
icant rise immediately after eating. The time involved is merely that be- 
tween breakfast and lunch on one day. It is puzzling, to say the least, that 
the results of Meyer and of Goetzl et al. are so sharply at variance. Meyer 
himself does not even refer to the work of Goetzl et al. so there are no sug- 
gestions from him, and the published accounts of procedure are not detailed 
enough to reveal grounds for any critical differences. Since the problem under 
investigation is an important one, not only from the point of view of receptor 
action but also methodologically for experiments where such a variable is 
not wanted, it deserves further consideration. Previous reviewers have 
commented critically on Goetzl’s failure to mention, if not include, some 
needed controls. One further recommendation might be to set up the pro- 
cedure in such a way that the experimenter has no knowledge of the degree 
of food deprivation represented by each subject. 

Four recent papers deal directly with some aspect of individual differences. 
Thieme (28) has contributed one on tasting of phenylthiocarbamide. It 
provides merely the frequencies of nontasters in a total of 3,229 Puerto 
Ricans, divided into 15 geographic regions and 10 ethnic groups. The total 
frequency was 12 per cent, with no significant differences between regions or 
groups. A study of individual reactions to the primary tastes was made by 
Kahn (17), but unfortunately the data suffer from an end effect. His con- 
centrations of most of the substances were not weak enough to measure 
thresholds accurately. The author concluded that variability was present, 
although no formal statistical estimate was made. The problem is a worthy 
one, however, for information on individual reactions is greatly needed. 
Pfaffman (27) has approached this same problem by way of species differ- 
ences. Using the rat, rabbit, and cat, he took recordings from the chorda 
tympani after stimulation of the tongue by substances for the primary 
tastes. The response to hydrochloric acid was similar in all animals but re- 
sponses to the other stimuli differed to such an extent that Pfaffman warns 
that ‘‘...any general theory of taste must take species differences into 
account.”’ In line with this idea, Murray et al. (23) described a series of four 
ingenious experiments which show a definite difference between rats and 





YIIM 


THE CHEMICAL SENSES 115 


human beings in their reactions to sodium sucaryl. Solutions which human 
subjects judged equally sweet and preferable to saccharin were avoided by 
rats when paired with water, although the rats preferred saccharin to water. 
The authors suggest that the differences could be a matter of differential 
sensitivity to the ‘‘sweet”’ and “‘bitter’’ components of sucaryl. This phenom- 
enon may be related to an earlier speculation that saccharin tastes bitter 
to dogs (see 30, p. 120). 

Changes in identification thresholds for the primary tastes after stimula- 
tion by monosodium glutamate were studied by Mosel & Kantrowitz (22). 
They tested four subjects just before, and 5 min. after, a 7-sec. exposure to 
a strong solution of monosodium glutamate. The thresholds for salt and sweet 
remained unchanged but those for sour and especially for bitter decreased. 
This whole matter is somewhat confused, both in terms of previous research 
results and of everyday experience. Therefore, the authors could have made 
a greater contribution if they had included a control group and more sub- 
jects so that more sensitive statistical techniques could have been applied 
and confidence increased. 

Studies on insects—In the hands of competent workers, certain insects 
have proved to be very convenient subjects for investigating chemical 
sensitivity. The accessibility of their receptors is an especially valuable char- 
acteristic. There are two experiments to report here, both excellent examples 
of good design. 

The relative acceptance by the honeybee of sugars present in nectar was 
studied by Wykes (31) under both field and laboratory conditions. In the 
field, sucrose, glucose, and fructose were used, and the measure of preference 
was the number of bees on each dish at 10 min. intervals for 3 or 4 hr. Sucrose 
and glucose were preferred significantly over fructose but a mixture of the 
three sugars in equal parts was always preferred over any one, a mixture of 
any two, or an unequal mixture of the three. Maltose, not present in nectar, 
was added in the laboratory tests where the measure was uptake during 1 or 
2 hr. Here the order from most to least preferred was sucrose, glucose, 
maltose, and fructose. Again, the sucrose-glucose-fructose mixture in equal 
parts stood out; it was consumed to a significantly greater degree than was 
predicted from the mean uptake of the constituents. The implication is clear 
that the sugar composition of nectar may be an important factor in influenc- 
ing visits of bees to flowers. 

The basic problem of adaptation has been attacked systematically by 
Dethier (13), using the tarsal receptors of the blowfly and solutions of su- 
crose. The insect was held ina paraffin block so that only his front parts were 
free. The indication of adaptation was retraction of, or failure to extend, 
the proboscis when the legs were put in sucrose. It was found that the time 
for complete adaptation increased as the logarithm of the adapting sucrose 
concentration. Continued stimulation from supraliminal concentrations had 
no further effect on the threshold; time may be a factor in changing the 
threshold when subliminal concentrations are used, however. With continu- 
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ous stimulation at one intensity, intermittent bursts of activity occurred. In 
looking for the site of adaptation, two techniques were utilized with opposing 
results. If one leg was tied while the other was adapted, no response was 
made when the resting leg was then stimulated; this indicated central control. 
When the two legs were stimulated differentially by sucrose and by water in 
a divided dish and then the leg that at first had been in water, and hence 
unadapted, was put into sucrose a response occurred 66.6 per cent of the 
time. Dethier speculates that there may be two centers involved. Simul- 
taneous stimulation by mixtures of acceptable and unacceptable substances 
resulted in adaptation to both at the same rate. Further studies of this type 
would provide useful data to aid in understanding the action of the receptors. 

Taste and flavor testing —An extensive literature exists on these problems, 
but only a few of the more basic references, concerned chiefly with methodol- 
ogy, are included here. A good summary of techniques in panel testing is 
presented by Konigsbacher e¢ al. (18). They include the use of rating scales 
and flavor profiles, the use of binary test systems, and some general discus- 
sion of statistical treatments and applications. Binary test systems may be 
exemplified by the triangle test wherein each judge is asked to select from 
three samples the one which is different. Tests of this sort could be conveni- 
ent in the psychophysical laboratory for sorting subjects quickly into broad 
categories of sensitivity. Further attention to the importance of the experi- 
mental design and the availability of powerful statistical tools is given in 
articles by Mason & Koch (20) and by Bradley (10). The latter is particu- 
larly recommended for the discussion of statistical methods and its extensive 
classified bibliography. A methodological experiment testing the effect of 
certain features of the design on judgment of flavor characteristics was re- 
ported by Hopkins (14). 

An interval scale for sweetness was constructed by MacLeod (19) from 
half-judgments for sucrose and glucose separately and from judgments of 
equivalent sweetness between the two sugars. MacLeod’s attempt to evalu- 
ate validity by cross predictions within the obtained data can be implemented 
by testing predictions to other situations. Such a scale, if its validity can be 
established and if it can be used reliably, would be a great aid both in applied 
and theoretical problems. 

Learning.—A study of Pavlovian conditioning by Travina (29) showed 
that ‘‘alimentary”’ conditioned responses could be established with equal 
ease when the unconditioned stimulus was provided by the normal act of 
eating or by stimulating surgically exposed portions of the tongue. This 
result would be expected on the basis of various experiments using inade- 
quate stimulation for the unconditioned stimulus. 

Cubbage & MacPhee (12) have given an interesting demonstration of 
maze learning in rats as a function of gustatory cues. They used a linear 
maze with four choice points. There was a nozzle holding eight drops of 
liquid in front of each door, and there were two doors at each choice point. 
Sixteen rats were divided into two equated groups. Group I learned the maze 
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first with cues present, then ran without cues; Group II followed the reverse 
procedure. When cues were present, the nozzle in front of the correct door 
contained salt solution while the other had water. In the control condition, 
both nozzles held water. Running under 24 hr. water deprivation and with 
one trial per day, Group I met the criterion (group mean of 0.75 errors on 
four successive days) with cues in 32 trials; when cues were removed, the 
error curve rose. Group II, running first without cues, performed at chance 
for 20 trials after which cues were added and the criterion was met in 20 
trials. The results were highly significant, with p less than .001 for the effect 
of cues. This technique could be readily adapted to threshold studies and 
studies of sensory equivalence using rats and other animals. 


OLFACTION 


As in previous years, articles on olfaction dealing with the results of ac- 
tual research are less plentiful than on gustation. The area cannot be said to 
be suffering from neglect, however, for it is being discussed a great deal and 
an increasing amount of work can be hopefully anticipated. 

Neurophysiology—Bonvallet et al. (9, 42a), in the study discussed earlier, 
evoked potentials in several central areas following stimulation by means of 
bipolar electrodes in the two olfactory bulbs. The largest, fastest, and most 
stable responses to rapid stimulation (50/sec.) were recorded in the pre- 
piriform and piriform cortex, the anterior commissure and its branches, the 
ventral part of the head of the caudate nucleus, the olfactory tubercle, and 
the anterior limbs of the internal capsule. Their conclusion is that these 
regions constitute the direct olfactory system. Potentials with longer latency 
were detected in the claustrum, putamen, globus pallidus, and the lateral, 
central, and basal amygdaloid nuclei (the medial was not explored). The 
slowest responses, with latencies longer than 10 msec., occurred in the septum 
and the dorsal part of the head of the caudate nucleus. It is considered that 
the septum and hippocampus receive no direct fibers. All of the rapid re- 
sponses were completely homolateral except for a few rare bilateral ones in 
the anterior commissure. These results from stimulation confirm some earlier 
pathways mapped by means of degeneration. They point once more to the 
intricacy of the olfactory central connections as well as to the neuroanatomi- 
cal interactions with other sensory systems. Some potentials were recorded 
even in the cerebellum. 

Adrian (32 to 36), in the last year and a half, has continued to summarize 
in very brief articles his results obtained by direct electrical recordings from 
the olfactory bulb of the rabbit. He has been aided in this work by a special 
high-pass filter, designed in his laboratory, which rejects frequencies below 
400 c.p.s., making the continuous background potentials much less trouble- 
some in interpreting the spikes associated with stimuli. He no longer holds 
his earlier notion that the mechanical effect of the passage of air through the 
nostrils in normal breathing produced potentials in the olfactory nerves. 
When there was an odorant in the air which was drawn through the nose by 
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means of a syringe, the rate of flow had to be 4 cc. before it was effective. At 
rates from 6 to 30 cc., threshold was inversely related to concentration while 
latency was constant at all rates. Thresholds and latencies varied with the 
substance and the region tested. By recording in different parts of the bulb, 
he found that an abrupt discharge occurred near the anterior pole to stimula- 
tion by esters. Hydrocarbons resulted in a more gradual discharge deep in 
the posterior half. He speaks of evidence for the same relationship in the cat, 
plus a third posterior dorsal region where a discharge could be recorded from 
stimulation by amines. It is certainly possible, as he hints, that spatial dis- 
tribution of this sort may underlie olfactory discrimination. In addition, he 
shows that various groups of receptors are differentially sensitive. One elec- 
trode can be placed to record large spikes from a single neuron as well as 
smaller spikes from neighboring multiple units. Such a preparation in the 
anterior and posterior areas showed little difference in sensitivity, but it 
was not so in the middle. There, a unit which responded to a water soluble 
substance rarely responded to a lipoid soluble substance. In addition, the 
former response was characterized by abrupt onset and high frequency while 
the latter was often of low frequency. At the time of writing, he was unable 
to say how many receptor types there may be. It is hoped that he will soon 
present a more complete account of these researches, comparable to his re- 
view in 1950 (36a), for their importance seems very great indeed. 

Two further papers attest the growth of interest in the recording of 
electrical activity. Additional data have been obtained from the rabbit by 
Walsh (53) who confirms the general finding of a high level of resting ac- 
tivity, but with intermittent discharges when odorous room air is drawn 
through the nose. Upon application of specific odorous materials, a massive 
discharge was recorded. The principal contribution of Walsh’s report is 
concerned with adaptation. With prolonged stimulation, there was first a 
spurt of activity, but adaptation occurred from three to five respiratory 
cycles later in the pause between exhalation and inhalation. He notes that 
there were individual differences in responsiveness to different substances. 
Walsh was able to record spikes from single units in the opossum by means 
of micropipette electrodes under very light anesthesia. The picture was es- 
sentially the same as that from larger regions, viz., spontaneous resting dis- 
charge and increased frequency in the presence of odorous material. No in- 
formation was included in the printed abstract which would allow com- 
parison with any of Adrian’s specific findings. 

The work of Sem-Jacobsen et al. (50) is a fitting climax for this discussion 
of centrally recorded activity inasmuch as this group has described responses 
measured in a human subject. Opportunity was taken of examination pro- 
cedures preliminary to leukotomy on a 53-year-old psychotic woman to 
implant depth electrodes in the vicinity of the olfactory nerves and bulb. 
The left frontal electrode actually rested on the cribriform plate for some 
distance and gave less background discharge, as would be expected, than 
leads in the bulb. Inhalation of an odorous substance served to initiate, in 
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less than 1 sec., a -hythmic series of 28 to 32 c.p.s. and intensity of 60 uv. 
They feel that tk> bulb is the origin of the discharges recorded since axon 
spikes are absent and the discharge is a rhythmic one, actually the first 
example of a rhythmic response occurring to a constant stimulus rather 
than to an intermittent one. No direct data are presented for pure air 
as stimulus. A number of odorous substances of unspecified strength and 
purity were used as stimuli, and the intensity of response was graded. 
Little response was noted to room air, hyocyamus, acetone, and alcohol; 
moderate response occurred to oil of wintergreen, bitter almonds, onions, 
turpentine, camphor, raspberry, and cinnamon; the ‘“‘best’’ responses ap- 
peared to lilac perfume, oil of cloves, oil of lemon, benzine, coffee, oil of 
peppermint, oil of lavender, and valerian. Some of these rankings are 
quite surprising but should not be considered too seriously because of the 
poor co-operation received from the subject. Lack of co-operation was a real 
hindrance, too, in any attempts to relate the neural responses to discrimina- 
tive ability. Responses were completely abolished at all stages of thiopental 
(barbiturate) anesthesia, which is not true of animal preparations. Some 
extracranial recording was done successfully through the nasal fossa, but it 
was not as clear as that from the depth electrodes. If it is at all feasible to 
record in this way, however, the technique should certainly be exploited, for 
even reduced data might be better than no data. Two hypotheses are ad- 
vanced regarding the nature of the olfactory system in the human. First, 
we should consider the possibility of a population of neurons with a charac- 
teristic frequency of discharge remaining inactive until a stimulus disturb- 
ance, even a steady, continuing stimulus, sets the oscillation going. Second, 
based on the finding of an almost linear relationship between intensity of 
response and intensity of stimulation, is the suggestion that amplitude, 
rather than frequency-modulation underlies the olfactory system in contrast 
with most afferent systems. The experiments reported above may have opened 
a most fundamental research area in which real progress could occur. 
Psychotic subjects are involved in two further papers included in this 
section. Alliez & Pujol (37) have studied the coincidence of olfactory hallu- 
cinations and paranoia, a relationship often noted by others. On the basis of 
an analysis of several cases, with disorders both prolonged and recent and 
involving both pleasant and unpleasant odors, they suggest that the temporal 
lobe should be investigated in this connection. The electroencephalograms 
on these patients showed irregularities in the temporal lobe. The authors 
compare the often complex hallucinations to Penfield’s reports (49) of ‘‘forced 
thinking” occurring upon stimulation of cortical epileptogenic foci. Finally, 
Wenzel (54) has reported on measurements of olfactory thresholds pre- and 
postoperatively on 22 experimental patients and 4 controls in the second 
Greystone project. The experimental subjects underwent a variety of brain 
operations for the alleviation of psychosis. The only ablation considered to 
have a possible influence on olfactory acuity was transorbital lobotomy. 
The tests employed the Elsberg apparatus, with coffee, phenyl ethyl alcohol, 
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and peppermint oil as stimuli. The subjects’ inability to follow instructions 
reliably made it impossible to use the thresholds in any but a relative way, 
but it was clear that none of the operations had any effect either to raise or 
to lower acuity. 

A pparatus.—The ever-puzzling question of how to stimulate the human 
olfactory receptors has received some attention in the form of a blow by Jones 
(43) aimed at the Elsberg olfactometer. Many blows have been directed at 
this instrument, but Jones’ paper is probably the first to criticize it on the 
basis of direct empirical evidence. What might be called ‘‘Elsberg thresholds” 
were obtained on 10 subjects for three different concentrations of n-octane 
and amyl acetate. On the assumption that the number of odorous molecules 
entering the nostril is a significant variable for sensation, it would be pre- 
dicted that greater pressures (i.e., larger volumes) would be needed for de- 
tection of the weaker concentrations. This expectation was not realized; the 
thresholds for the different concentrations could not be shown to differ sta- 
tistically. The conclusion is that blast-injection thresholds are not directly 
comparable to thresholds obtained by presenting a series of concentrations 
of the odorant. So much work needs to be done in this area that it is dis- 
couraging to find only the above article. Inadequate methods must certainly 
be eliminated but so also must adequate ones be provided. A next step would 
seem to be to develop techniques for controlling the natural sniff. 

In the spirit of the quotation from Crocker in the introduction to this 
review, Langley & Polin (46) avoid the whole question of the human ob- 
server by replacing the osmoscope with a chemical measure of deodorant 
capacity. They developed a system in which odorous air is circulated in con- 
tact with a deodorant and the resultant is drawn off for chemical analysis to 
determine how much of the odorous components has been removed or modi- 
fied. Such a test is appropriate only when suitable analytic procedures are 
possible. Even then, the chemical reduction would have to be made meaning- 
ful in terms of a human being’s impression of odor reduction. 

Individual differences—lIn a search for an easily obtained and valid 
indicator of systemic fatigue, a group of investigators (52) carried out two 
preliminary experiments using healthy young men as subjects to test the use 
of the olfactory threshold for this purpose. In one experiment, thresholds for 
phenyl ethyl alcohol were measured by a modified stream injection technique 
at four different times. Tests were made 23 hr. after breakfast and lunch on 
two successive days, with the afternoon test on the first day immediately 
preceded by controlled exercise (walking up and down stairs). In the other 
experiment, thresholds were obtained for dry coffee by means of blast injec- 
tion 1} hr. after breakfast and 4} to 5} hr. after dinner on the same day. 
The evening session was calculated to take advantage of the normal accumu- 
lation of fatigue throughout the day. In neither experiment was there any 
indication whatsoever that thresholds were varying with fatigue. The 
hypothesis was considered not worth pursuing. Given more sensitive measur- 
ing devices and perhaps greater amounts of fatigue, some relationship might 
be detectable. 
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Kirk & Stenhouse (44) have given figures on a sample of the white popu- 
lation of Australia for incidence of inability to smell potassium, or possibly 
hydrogen cyanide. (The title speaks of potassium cyanide but the two names 
are used about equally often in the article with no clarification. Hydrogen 
cyanide seems more likely to be odorous.) The difference between adult males 
and females, 18 per cent and 4 per cent, respectively, suggests sex linkage 
which is somewhat supported by the data for 148 children of 122 parents in- 
cluded in the above figures. In addition, the intensity of the sensation for 
smellers seems to be affected by other genetic factors. 

Odor and chemical constitution.—Stoll (51) reports briefly on variations in 
odor associated with an homologous series of bicyclic farnesyl synthetic 
compounds. As usual, intensity of odor is said to decrease with molecular 
weight and character of odor changes as well. He believes that both intensity 
and character of odor are determined by the form and size of the molecule 
while the functional groups are only partially influential. For example, in 
the macrocyclic series the epoxydes are less odorous whereas in this series 
they are more odorous. On the basis of his finding of adaptation in 10 to 20 
sec. for molecules in the middle of the series, he suggests that there are fewer 
receptive centers for relatively larger molecules. Otherwise, adaptation 
would be slow since these molecules would take longer to reach all the centers. 

The promising beginning of a systematic exploration of adaptation has 
been made by Cheesman & Mayne (40). They hope to work out a hierarchy 
of materials in terms of the degree of community between pairs of substances 
as indicated by the slope of the adaptation function relating them. Such a 
hierarchy could then presumably be related to the chemical and physical 
characteristics of the substances. Their general procedure is of interest in 
that the masking thresholds were group measures. In summary, all subjects 
were tested at once, each one taking a sniff of the adapting substance and 
then sniffing various concentrations of the test material and recording a posi- 
tive or negative response for presence or absence of odor. The 50 per cent 
threshold was calculated by probit analysis and log-log plots made of thresh- 
old as a function of adapting concentration. The substances used in this ex- 
periment were isopropanol, cyclopentanone, and dioxan. When both stimuli 
were the same the slope of the best-fitting line was —0.7; when they were 
different the slopes varied from —0.2 to —1.0. 

Studies on insects. A paper related to the preceding topic of chemical 
constitution, using blowflies as subjects, describes the function between re- 
jection thresholds and concentration of homologous aliphatic alcohols as 
measured by Dethier & Yost (41). The same series of alcohols had previously 
been used in tests on tarsal receptors. Rejection, measured by per cent of 
flies at a porthole that admitted odorous air as compared with a porthole for 
pure air, increased with log of concentration and varied directly with vapor 
pressure. As the chain length increased, rejection occurred at logarithmically 
decreasing concentrations with no further stimulation beyond decanol. 
When thresholds are expressed as thermodynamic activity, all of the alco- 
hols are equally stimulating although this relationship is somewhat obscured 
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at the extremes. It leads to the hypothesis that the critical process for olfac- 
tion in this case is one of equilibrium. Stimulation of tarsal receptors had not 
shown such a relationship. The reports from Dethier’s laboratory should be 
followed closely by those interested in olfaction for their implications go be- 
yond insect physiology. 

Two further studies will be reported briefly. In one, Barnhart & Chadwick 
(38) showed that flies apparently contribute something to spots which they 
visit (dishes of mush bait, in this study) which increases their attractiveness 
for other flies. What the substance might be or by what mechanism it acts, 
is not discussed. Glynne Jones (42), in testing repellents for honey bees, 
found that repellency of phenol increased, decreased, and then increased 
again as concentration was increased. He concluded that olfaction was ef- 
fective for the weaker stimuli but that the common chemical sense, which 
the psychologist would prefer to subsume under cutaneous sensitivity, sup- 
planted it when stronger stimuli were used. 

Industrial problems ——Minimal representation is included in this section 
for two industrial problems which are intimately concerned with odors, viz., 
ventilating and air conditioning, and use of chlorophyll as a deodorant. In 
the first, Kuehner (45) has summarized his experiments concerned with de- 
termining the strength of odor which is intolerable in an occupied space. 
Naturally, he first faced the problem of how to measure odor level and finally 
resorted to a dilution method, similar to the familiar Fair and Wells tech- 
nique. The entire procedure and the report’s generality have been criticized 
by various persons, among them a psychologist whose main objection is to 
the insistence on judges who are in extremely high agreement, thus eliminat- 
ing individual differences which the industry must take into account. He 
mentions other dubious points in the procedure as reported, such as omission 
of any statement as to precautions against adaptation, the use of only two 
observers, and the discarding of ‘‘divergent readings.”’ 

If this review is to typify its period, it can hardly ignore chlorophyll. An 
uncritical account of the range of its effectiveness as a deodorant has been 
prepared by Lesser (47) who takes at face value results obtained by some 
inadequate experimental procedures. A series of experiments with uniformly 
negative results has been reported by Brocklehurst (39). He made use of 
either chemical tests (see Langley & Polin above) or the human nasal instru- 
ment to evaluate odor reduction. With the former, he found no difference 
in amount of methyl mercaptan remaining in a circulating system after 
contact with one of his chlorophylls as compared with the amount left after 
contact with water. Of more interest to the psychologist are those tests where- 
in the sensation of odor was the criterion. In a variety of ways, several odor- 
ants were put in contact with one or another chlorophyll preparation for 
varying times. In each case, the test was mere presence or absence of odor 
and, with a couple of minor exceptions, odor was present in each test case. 
Brocklehurst has been criticized for making no effort to measure degree of 
odor and for using many odorants (such as perfumes) which were never 
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claimed to be susceptible to chlorophyll. A very few clinical trials were also 
run, again with completely negative results. The author is roundly criticized 
in several letters to the editor for chemical and other errors, including some 
already stated here, and especially for failing to specify his chlorophylls 
exactly after criticizing previous workers for the same thing. One final ar- 
ticle on chlorophyll is included since it gives a clue to the chemical action 
when deodorization actually occurs, as it does with many compounds con- 
taining sulfur. Mitchell (48) reports that benzyl mercaptan in an aqueous 
solution of sodium chlorophyllin or sodium copper chlorophyllin becomes 
odorless in 24 hr., forming a suspension of odorless dibenzyl disulfide. The 
same thing happens in air but requires at least 10 days. Nothing happens, 
with or without chlorophyll, if no oxygen is present. The effect of the chloro- 
phyll is greatly to speed the action of oxygen, but it is not a true catalysis 
since a given quantity of chlorophyllin is limited in the amount of odorant 
it can affect, in spite of the fact that its chemical behavior, color, and spectro- 
photometric properties are unchanged. 

In conclusion, the present reviewer wishes to underline a point of view 
implicitly represented in several places above. The time has come in the 
study of chemoreception when it is not enough to present data from two or 
three subjects, obtained in a poorly designed experiment. If researchers are 
to build on each other’s work, each has an obligation to conduct his experi- 
ment in such a way as to make possible a proper statistical estimate of the sig- 
nificance of his findings. Failure to utilize modern techniques of design and 
analysis in experimenting merely adds confusion where an abundance of the 
same already exists. 





The following four articles arrived too late for original inclusion in the 
chapter but will be briefly reviewed here because of their importance. All are 
concerned with the interaction of the olfactory system with other aspects of 
physiology. David et al. (40a) have shown that the female cat’s sexual cycle 
can be altered by electrical stimulation of the olfactory bulbs. The data are 
rather skimpy and irregular, but the relationship is an interesting one. The 
indication is that both the hypophysis and ovary are affected. 

LeMagnen has continued his physiological investigations. In an experi- 
ment with Rapaport (46a) he has found that rats with severe deficiency of 
vitamin A are unable to make odor discriminations which are normally 
possible. The yellow color of the olfactory area has suggested the importance 
of this vitamin in its functioning. LeMagnen has also investigated some of 
the relationships between olfaction and sex in the white rat (46b). He has 
established the fact that rats in different sexual conditions present different 
odors to other rats since the latter respond differentially in a choice situation 
where olfactory cues are the only ones present. The sexual condition of the 
observing rat also affects its performance. Somewhat inconsistent results 
were obtained in experiments designed to study the effect of different sexual 
conditions on ability to discriminate between presence or absence of certain 
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odors in varying concentrations. The last article in this series (46c) is a gen- 
eral review of the effects of olfaction on psychophysiological functions and is 
recommended for its inclusiveness. 
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INDIVIDUAL DIFFERENCES! 


By Jon B. CARROLL 
Graduate School of Education, Harvard University, Cambridge, Massachusetts 


As any reasonably sophisticated reader of this volume will recognize, 
some overlap between this chapter and others is inevitable. This review of 
recent research, however, is oriented around what may be regarded as the 
central problems of the ‘“‘field’’ of individual differences: (a) whether it may 
be said, in any sense, that dimensions of individual differences exist, and if 
so, what these dimensions are; (b) whether these dimensions can be accounted 
for in terms of lower order concepts of psychology, such as learning; and (c) 
how dimensions of individual differences may be most effectively measured. 

The theory and results of factor analysis will be used as a point of de- 
parture, partly because the reviewer finds it interesting to do so, but also be- 
cause it is his conviction that the study of correlation matrices by factor 
analysis is the method of choice in the identification of basic variables. Our 
premises are as follows: tests, ratings, and other measurement procedures 
may overlap considerably with regard to the traits they measure; we can de- 
termine the extent of overlap by noting their intercorrelations; and finally, it 
is the function of factor analysis to summarize the overlaps found in a set of 
measures and to indicate, to the extent possible, mutually nonoverlapping 
clusters of measurements. Any such cluster may then be regarded as measur- 
ing an underlying trait of individual differences. Of course, factor analysis 
usually attempts to do more than this; it describes each test variable as a 
linear function of a certain number of ‘‘common factors.’’ The controversies 
about methods of factor analysis stem largely from the fact that mathemat- 
ically an infinite number of ways of doing this are possible, but if it is remem- 
bered that the basic objective of factor analysis is to identify nonoverlapping 
(or at least, linearly independent) clusters of variables, and that the achieve- 
ment of this objective may necessitate series of factor-analytic experiments, 
many of the controversial points can be resolved (22, 43). The usually justi- 
fiable requirement that the coefficients of the factors in the linear descrip- 
tion of tests be generally zero or positive enables a reasonable choice among 
the infinitely numerous alternative solutions. 

These considerations seem to have been overlooked by Loevinger (73) in 
her examination of the position of factor analysis in psychological theory. 
Despite her able exposition of various factorial procedures, she fails to point 
out adequately the positive contributions that factor analysis can make, 
and gives false impressions regarding some aspects of theory and practice 
in the field. For example, her statement that blind rotation of axes (that is, 


1 The period covered by this review is approximately May, 1952 to May, 1953. 
References to work falling outside this period are in general not listed in the bibliog- 
raphy, but can easily be found in the articles under discussion. 
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rotation performed without a knowledge of the identities of the test vari- 
ables) is ‘“‘neither widely practiced nor widely advocated”’ is simply not true, 
in the reviewer’s experience. Again, in criticizing the hypothesis that ‘‘tests 
can be regarded as precise mathematical functions of factors,’’ she writes: 


. .. tests cannot be considered precise mathematical functions of anything, for as 
psychological tests are now constructed, two people with the same score will in general 
have done different items correctly. Thus the items must measure different abilities, 
and a given total score will represent different sets of abilities for different people. 


In the first place, this statement ignores the well-known theorem that a test 
score can be regarded as composed of a true score and an error score. In 
Lord’s (75) model for items measuring a single underlying ability, it is evi- 
dent that two people getting the same score could have passed different 
items, and this would happen solely by virtue of the fact that a true prob- 
ability of passing an item (which may lie between zero and unity) can only 
be manifested as an a posteriori probability of either precisely zero or pre- 
cisely unity (that is, after the item has actually been passed or failed). The 
difference between the two probabilities is error, and such errors are inevi- 
table, even in the most carefully conducted psychophysical experiments. In 
all psychophysical dimensions known thus far, it is found that in the region 
of the limen the true probability of detecting a stimulus or a stimulus differ- 
ence is less than one and greater than zero. In the second place, Loevinger’s 
argument doubles back on itself. As soon as she mentions “‘different sets of 
abilities’’ she postulates the very things which factor analysis seeks to dis- 
cover. In the case of a test with a complex factorial composition, it is per- 
fectly easy to show that a given score might represent different sets of abili- 
ties, and this is nothing that would alarm a student of factor analysis. 
Loevinger invites factor analysts to demonstrate the status of factors by 
methods completely independent of factor analysis; in reply, one may say 
that if she will accept Mill’s method of concomitant variation as this ‘‘inde- 
pendent method,” this can be done very easily. 

There remain, of course, fundamental questions as to the psychological 
meaning of a “factor,” but these questions are the same ones that can be 
raised about the concept of a psychological ‘‘trait.’’ This is not the place to 
take up this problem; we shall only cite Coombs’ discussion of it (28). Sakoda 
(99) wonders whether factors (or traits) may not more appropriately be iden- 
tified with situations rather than with individuals, but his discussion misses 
the point that nobody has ever expected individuals’ traits to manifest them- 
selves except in various “‘situations” or stimulus constellations. 


DIMENSIONS OF INDIVIDUAL DIFFERENCES 


In some ways, the reviewer regrets that in this section it is impossible to 
list and describe all the factorial dimensions discovered or confirmed in the 
literature under discussion, for these dimensions constitute the chief sub- 
stantive results of these studies. Nevertheless, the interested reader will be 
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well advised to resort to the original articles, in view of the fact that no brief 
phrases that might be used here could suffice to give an adequate idea of the 
nature of these dimensions. 

Dimensions of ability—Zimmerman’s (119) orthogonal reanalysis of 
Thurstone’s original primary mental abilities data (110) is a welcome con- 
tribution, bringing that early study more into line with the results of current 
studies and also with other reanalyses of the Thurstone data. The verbal 
and numerical factors are better defined, and the factorial content of the 
fluency, memory, perception, and reasoning domains has now become con- 
siderably clearer. Perhaps an oblique solution would make the factors even 
more psychologically meaningful. Guilford, Fruchter & Zimmerman (57) re- 
port a factor analysis of 39 experimental tests and 7 reference tests. Seven of 
the 13 factors were clearly identifiable with previous known factors, and 
several relatively new factors were identified. It is of interest that two inde- 
pendent orthogonal solutions were made and reported; despite one or two 
substantial differences between the solutions, one gets the impression that 
they were nearly equivalent with respect to providing a basis for the inter- 
pretation of the factors. 

Some students of factor analysis in Great Britain, led by Burt (20) and 
Vernon (112), have supported a theory of a “hierarchical organization of 
cognitive ability.”” The evidence brought forth by Moursy (83) demands 
serious consideration. Nearly all the currently available analytic procedures 
are applied to the correlations between 20 tests designed to measure four 
types of ability: sensorimotor, perceptual, associative, and relational. By 
the method which Moursy prefers, a ‘‘general cognitive ability” is found in 
all tests of the battery; the tests are then subdivided into those measuring 
“practical ability’ and “intellectual ability”; finally, these groups are still 
further subdivided. The results appear to confirm the initial hypothesis of 
hierarchical organization, and the solution seems to have been unique and 
not subject to arbitrary manipulation. The ‘‘simple structure”’ analysis of- 
fered by Moursy was orthogonal, and not a very good one at that. Moursy 
might have confirmed the hierarchical hypothesis alternatively by an oblique 
simple structure solution containing two correlated second-order factors. 

Guilford et al. (58), Corter (30), and Myers (84) report factor analyses in 
the domain of reasoning abilities. The first of these confirms the existence of 
several factors found in other studies. Corter’s study was especially designed 
to test certain hypotheses as to stages in the process of reasoning, but was 
not particularly successful in this respect. The results of Myers’ study again 
support the proposition that there are both speed and level-of-mastery com- 
ponents in timed tests, as had been shown previously (35), but much more 
research on this problem is needed. 

At least four studies of factors in the perceptual domain appeared during 
the last year. Thurstone’s ‘“‘speed of closure’’ and ‘‘flexibility of closure’’ 
factors are investigated by Pemberton; in the first of her studies (88) evi- 
dence is found that flexibility of closure generalizes to tasks requiring higher 
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cognitive functions, specifically those requiring ‘analytical reasoning’; in 
the second study (89), scores on these closure factors are found to be related 
to temperament variables as measured by certain objective personality in- 
ventories. Persons high on flexibility of closure tend to be socially retiring, 
independent of other’s opinions, and averse to routine, according to this re- 
port. Persons high on speed of closure tend to be sociable, artistic, systematic, 
neat, and possibly “‘intolerant of ambiguity,” in Frenkel-Brunswik’s sense. 
A major contribution to the description of the perceptual domain is made by 
Roff (94), who identifies several factors similar to those in previous studies 
as well as several new factors. Some of the new factors appear only in motion 
picture tests, which were also analyzed by Fruchter & Mahan (50) with re- 
sults somewhat parallel to those of Roff. 

One of the most interesting reports to appear recently concerns tests of 
“creative thinking,’’ by Guilford e¢ ai. (60). A number of hypotheses about 
factors in creative thinking are confirmed, and several extremely intriguing 
factors are disclosed. It would seem that the farther one moves away from 
the academically flavored types of tasks included in the usual intelligence 
tests, the more one finds identifiable factors which may have considerable 
relevance to the study of higher-level mental processes. One awaits the results 
of a similar exploration of ‘evaluative abilities,” now under way by Guilford 
et al. (59). 

Dimensions of individual differences measured by psychomotor tests 
have also received considerable attention. Roff (95) isolates six psychomotor 
factors, most of them new, and Fleishman (46) finds that the same factors 
in the respective trials of two rudder-control tests exist regardless of the 
order in which they are administered. 

Dimensions of personality—New evidence on the factorial structure of 
objective personality inventories is furnished in the extremely instructive 
research of Baehr (9), who made a second-order factor analysis of nine first- 
order factors previously identified by Thurstone in three personality sched- 
ules by Guilford. Two of the four factors which emerged were found to 
correspond to variables postulated in the Heymans-Wiersma conceptual 
scheme of temperament traits. Next, by a direct first-order factor analysis of 
22 behavior items selected from the original inventories in such a way as to 
exclude first-order contrasts, three of the second-order factors previously 
identified were obtained as first-order factors. By showing that a factor can 
appear either in the first-order or the second-order domain, depending upon 
the selection of variables, the study suggests a hierarchical organization of 
personality traits much like that which, as Moursy suggests, exists for 
cognitive abilities. Cattell & Horowitz (25) published another in a series of 
factor studies of personality, this one being concerned with personality 
traits in the domain of “altruism.” 

A study by Carroll (23) shows that to some extent, the five traits pre- 
sumably measured by Guilford’s Inventory of Factors STDCR can be dif- 
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ferentiated not only in the scores of the personality inventory itself but also, 
and even better, in terms of self-ratings and peer-ratings on scales especially 
constructed to correspond to the traits. 

Several studies (not all using factor analysis, however) are concerned with 
the so-called trait of masculinity-femininity. Ford & Tyler (47) find that 
Terman and Miles’ masculinity-femininity test contains at least two dimen- 
sions, for both sexes; one dimension has to do with interests, and the other 
with emotional characteristics. Lee (72) believes that since Bennett’s Me- 
chanical Comprehension Test (MCT) predicts masculinity-femininity scores 
better than it does any criterion of mechanical ability, it can be regarded as 
a nonfakable masculinity-femininity test. One wonders which dimension of 
masculinity-femininity identified by Ford & Tyler is the one correlated with 
the MCT, and what implications Lee’s results have for the factorial inter- 
pretation of the MCT. Gough (55) has constructed a new ‘‘psychological 
femininity” scale, but does not concern himself with the possible lack of uni- 
dimensionality in his scale. 

It is apparent, also, that problems of multiple dimensionality are latent 
in the measure of ‘“‘authoritarianism” proposed by Adorno et av. (1). Hofstaet- 
ter (62) showed that this scale actually measures five factors which are largely 
independent: anti-Negroism, anti-Semitism, national pride, puritanism, 
and state socialism. We may agree wholeheartedly with Hofstaetter’s con- 
clusion that, as he puts it, ‘‘there is no justification for combining these inde- 
pendent dimensions of variability into one type, ‘the authoritarian personal- 


ity’. 

The utility of factor-analytic procedures in the identification of entities 
in abnormal behavior was demonstrated in two studies. Degan (37) re- 
analyzed T. V. Moore’s data on symptoms of manics, depressives, and schizo- 
phrenics. The nine first-order factors which were isolated were regarded as 
syndromes; when these were subjected to a second-order analysis, four fac- 
tors emerged, identified as corresponding to the well-known diagnostic classi- 
fications of mania, hebephrenic schizophrenia, depression, and catatonic 
schizophrenia. Eysenck (42) studied intercorrelations between 20 scores on a 
variety of tests administered to 100 normals, 50 schizophrenics, and 50 
manic depressives. In centroid analyses of the conventional type he found 
that the factor patterns for normals resembled those of psychotics to a con- 
siderable extent. By using the method of “criterion analysis’ which he had 
developed earlier, he claimed to show that schizothymia-cyclothymia does 
not exist as a separate dimension of personality found only in psychotics. 
Thus, the study showed what has been hypothesized for many years: that 
the difference between normality and psychosis is a matter of degree rather 
than kind. Such a conclusion, of course, does not preclude a concern with the 
dynamic processes which take some individuals into mania and others into 
schizophrenia. It would now be interesting, incidentally, to attempt a study 
of normals using the variables which Degan investigated. One wonders, also, 
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whether the battery of tests used by Eysenck was sufficiently diversified to 
enable a univocal test of his hypothesis as to the continuity between nor- 
mality and psychosis. 

Attention should be called to two books on personality study (41, 45), 
both concerned to a considerable extent with problems of identifying indi- 
vidual difference dimensions. 

Dimensions of interests and attitudes—The original Allport-Vernon 
Study of Values test was subjected to a factor analysis by Brogden (17) ina 
study which was more thoroughgoing than an earlier one by Lurie (79). The 
results paralleled Lurie’s quite well and at the same time shed light on the 
proper interpretation of the six evaluative scores originally set forth by All- 
port and Vernon. It is shown that these scores are not factorially pure, but 
are in most cases averages of at least two of the more refined factors isolated 
by Brogden. In all, Brogden was able to identify 10 first-order evaluative 
factors; one interpretable factor in the second-order domain was thought to 
represent the difference between “idealism” and “‘practicality.”” Even though 
not analyzed factorially (possibly because overlaps in the scoring keys and 
the forced-choice feature of the preference test cause a degree of experimental 
dependence not desirable in factor studies), the data presented by Stanley & 
Waldrop (107) concerning intercorrelations of scores on the Study of Values 
and the Kuder Preference Record are of some interest in showing that the 
scales from the two instruments that might logically be expected to corre- 
spond do not actually intercorrelate very highly. In the light of Brogden’s 
analysis of the Study of Values and various studies of the Kuder Preference 
Record, it would seem that the discrepancies are due to differing factorial 
compositions of the scales. 

Strong & Tucker (108) were able to construct a number of relatively in- 
dependent scales for the measurement of interests in various medical special- 
ties. A factor analysis of the scales is presented. One continues to wonder at 
the proliferation of scales on the Strong Interest Inventory in view of the fact 
that it has previously been shown practicable to compute occupational scores 
as linear combinations of a small number of interest factors. Barnett e¢ al. 
(11) found that the occupational level scale of the Strong test is not a measure 
of achievement drive but of the socioeconomic level where the individual 
will be most comfortable. Garry (52) discovered that the ability to fake scores 
on the Strong Interest Blank seems to be an ability which is independent of 
intelligence, sex, or information regarding the occupations involved. Perhaps 
this ability would be found to have some sort of diagnostic significance if 
further investigated. Long (74) developed a Job Preference Survey which is 
designed to be appropriate for placement of unskilled and semiskilled work- 
ers. To judge from their intercorrelations, the six components of work inter- 
est seem to be relatively independent: (a) routine—varied, (b) indoor—out- 
door, (c) hazardous—nonhazardous, (d) sedentary—bodily active, (e) iso- 
lated—gregarious, and (f) precise—approximate. It would be desirable to 
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attempt to tie these interests to a broader framework of personality structure. 
Some of them may be associated with cognitive abilities. 

Dimensions of criterion measures.—In the search for dimensions of individ- 
ual differences, it is useful to investigate what are commonly called ‘‘cri- 
terion measures,”’ that is, measures of performance in various occupational 
pursuits and in various complex tasks. An occupation itself constitutes a situ- 
ation in which the behavior of the individual can be measured, and is thus 
analogous to a psychological test, though of much greater complexity and 
duration. It is likely that occupations, considered as tasks, will tap dimen- 
sions of individual differences which are well-nigh inaccessible to the usual 
psychological testing procedures; this, at least, is suggested by the findings 
of Brown & Ghiselli (18) that in general there is a very low relation between 
the validity of a test in predicting trainability and its validity in predicting 
job performance. To quote these writers, ‘‘The abilities important for learn- 
ing a job may differ markedly from those important in the maintenance of 
proficiency on the job.”’ At the same time, it must be remembered that each 
occupation or complex task presents a distinct situation. A factorial dimen- 
sion emerging in studies of occupational criteria may be specific to the occu- 
pation involved, and since it would be rare to find individuals who could be 
measured in more than one occupation, the cross-identification of factors 
among occupations is somewhat difficult. Nevertheless, the results of facto- 
rial studies of occupational criteria are often useful, even though they may 
leave the investigator unable to devise suitable predictors of these criteria. 
Indeed, even the criterion factors will often seem extremely inconstant and 
elusive. This is illustrated particularly well in a study by Bach (8); despite 
the finding that somewhat the same factors were identified in ratings of prac- 
tice teaching and in ratings of in-service teaching, the relations between pre- 
service and in-service ratings were negligible. 

Newman, French & Bobbitt (85) find three factors observable in perform- 
ance at the United States Coast Guard Academy: adaptability to academy 
life and activities, athletic proficiency and attitude, and academic grades. 
Studies by Ryans (96) and Ryans & Wandt (97) show that the factors emerg- 
ing from ratings of observed teacher behaviors in the elementary school are 
much the same as those in the secondary school. In a factorial investigation 
of measurements applied to industrial trainees Bajard (10) found one factor 
common to his tests and his criterion measurements; two other factors were 
specific to the tests. 

Several mathematical techniques and experimental designs to aid in the 
interpretation of relations between predictors and criteria can be mentioned 
here only briefly. Wrigley (118) has proposed and illustrated a technique 
whereby criterion variables are factor-analyzed and predictor regression 
weights for estimating the criterion factor scores are computed. The proce- 
dure has certain resemblances to canonical correlation. Fruchter (49) has 
considered the problem of describing criterion measures in terms of a set of 
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predictor variables. This problem arose in the United States Air Force, where 
groups of recruits are assigned to different technical courses after their basic 
training. Fruchter’s technique involves locating the criterion measurements 
obtained in each of the technical schools in the space of the predictor matrix. 
In his empirical data, the proportions of criterion variance accounted for by 
predictor variables range from .14 to .93; predictor factors identified as 
verbal comprehension, numerical facility, mechanical experience, and visual- 
ization contribute to the various specialty criteria in a manner which coin- 
cides fairly well with what one might expect in view of the nature of these 
factors and the content of the specialties. For example, the mechanical ex- 
perience factor appeared most strongly in mechanical courses and not at all 
in such specialties as medical corpsman. 

Experimental studies of individual difference dimensions.—The time does 
not seem to have arrived when, as hoped by Thurstone and other pioneers of 
factor analysis, the dimensions of individual differences isolated by this tech- 
nique could be subjected to experimental study of the type which would 
manipulate learning, motivation, and similar independent variables. The 
few investigations which have been of this type seem never to have been con- 
sciously designed with such an end in view; they have stemmed from other 
approaches. Nevertheless, some of these studies can be turned to use in the 
field of individual differences. One example from recent literature may be 
cited. Maltzman & Morrisett (81) traced the effects of different kinds of 
training on the solution of anagrams. Now, anagram tests have frequently 
been used in factor studies of verbal fluency. It is therefore interesting to learn 
that subjects trained with a diversified list of anagrams make significantly 
better progress in solving anagrams in a particular semantic category than 
subjects trained in solving anagrams both of that category and equally of 
another category. The experiment was conducted in order to test certain 
propositions about habit strength. To students of individual differences, 
however, it prompts the hypothesis that verbal fluency as measured by ana- 
grams tests is partly a function of the restricted versus diversified character 
of the incidental environmental training which the subject has received. 

Several other studies of problem solving performances suggest that the 
factor of individual differences which is most often known as the “‘reasoning 
factor” does not need to be regarded merely as a statistical artifact arising 
from the intercorrelations of tests. O’Connor (86), in a correlational study, 
found that “intolerance of ambiguity” is related to poor abstract reasoning 
ability only when it is associated with ethnucentrism. Schroder & Rotter 
(102) discovered that they could experimentally create different degrees of 
rigidity which would differentially affect performance in a problem solving 
task. Thus, they regard rigidity “‘not as a trait or entity but as a kind of be- 
havior predictable from specific learning experiences.”” Cowen, Wiener & 
Hess (31) obtained a correlation of .42 between rigidity scores on Luchins’ 
water-jar task and a specially constructed ‘‘alphabet-maze” designed to 
measure a different aspect of rigidity in behavior. Thus, rigidity now seems 
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to be a fairly general source of individual differences in performing certain 
kinds of tasks, particularly those involving reasoning. It is of some interest, 
incidentally, that Cowen et al. report U-shaped distributions for their meas- 
ures. Such distributions are usually hard to come by. 

Factorial interpretations of certain well-known tests—One may applaud 
the trend toward investigations of the factorial composition of certain widely 
used clinical tests. Ansbacher (6) correlated scores on the Goodenough Draw- 
a-Man test with scores on Primary Mental Abilities tests and certain of the 
MacQuarrie tests. Higher correlations were found for Reasoning (.40), 
Space (.38), and Perception (.37) than for Verbal Meaning (.26) and Number 
(.24). Since the test correlated to the extent of .34 with the MacQuarrie trac- 
ing subtest, Ansbacher suggested that the Draw-a-Man test also measures 
“youthful willingness to follow directions.’”’ Unfortunately, one cannot tell 
from this investigation whether the connections between the Draw-a-Man 
test and Primary Mental Abilities tests are due to group factors or to a gen- 
eral factor of intelligence. 

The Wechsler-Bellevue test also came under scrutiny from the standpoint 
of a factor-analytic rationale. In the first of two papers, Cohen (26) found 
that the test can be resolved into three common factors (connected by a sec- 
ond-order general factor) which he called ‘‘verbal,’’ ‘‘non-verbal organiza- 
tion,” and “freedom from distractability” respectively, and that the facto- 
rial composition of the subtests is in general highly consistent whether one 
tests psychoneurotics, schizophrenics, or brain-damaged cases. In the subse- 
quent paper (27), Cohen proposed a “‘factor-analytically based rationale for 
the Wechsler-Bellevue.’’ Disagreeing with the test rationales offered by 


_ Wechsler and by Rapaport, he finds that test performance in patient popu- 


lations can be interpreted readily in terms of his common factors. Little evi- 
dence is found for test specificity, as would be required by the Wechsler and 
Rapaport rationales. On the other hand, some of the Wechsler-Bellevue sub- 
tests seem to operate differently in different neuropsychiatric groups; thus, 
interpretation of the tests is partly dependent on the diagnosis. Cohen’s work 
has a strong appeal but suffers somewhat from the fact that only the sub- 
tests of the Wechsler-Bellevue were studied. No “‘reference’’ tests of pre- 
viously established factors were included in the analysis. 


VARIABLES ASSOCIATED WITH INDIVIDUAL DIFFERENCES 


In the studies to be discussed below, we shall consider some constitutional 
and environmental variables which may account for individual differences. 
As has been true of similar research in the past, most of these investigations 
are limited by the fact that they are not sufficiently discriminating with re- 
spect to the variables of individual differences which are studied. When the 
dependent variable is a complex hodgepodge like ‘“‘general intelligence,” we 
cannot draw any conclusions as to which aspects of individual differences are 
associated with a given genetic or environmental vraiable. 

The stability of individual differences——Before investigating variables 
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associated with individual differences we may well look to the stability of 
these differences. Very little has been contributed to the literature on this 
problem recently. The study by Husén (65) merely confirms existing evi- 
dence that intelligence test scores are stable for adults over periods of a year, 
and that the correlations obtained will depend upon the extent to which the 
tests are equivalent. Though based on only six cases, a study by Smith (106) 
is interesting because it is perhaps the first longitudinal study covering a 50- 
year period. A notebook kept by a mother on her six children for eight years 
furnished the data for this study; the life histories of these individuals 50 
years later were also available. Considerable consistency was found in such 
areas as health, learning ability, personality, and interests. 

Genetic and physiological variables.—In an elaborate statistical treatment 
of certain data from the so-called Scottish Survey, Roberts (92) claimed to 
find further evidence for Pearson’s hypothesis of two types of mental defi- 
ciency: the type which is found in the lower tail of the normal curve and 
which may be described in terms of multi-factorial genetics, and the ab- 
normal, qualitatively different type ascribed to the operation of a single gene. 
Nevertheless, it is probable that both these types would be regarded as endog- 
enous by Feldman (44), who found that exogenous mental defectives do 
better on the Bender Gestalt tests. Biesheuvel (13) summarizes the argu- 
ments against the use of IQ tests in studies of heredity and environment or 
in studies concerned with a putative decline in the intelligence of the popula- 
tion. He sees no evidence that the national intelligence in South Africa is de- 
clining, and confirms certain indications of the Scottish Survey on family 
size and intelligence. Short (104) finds reason to believe that imagery types 
(visual versus verbal) may be of an inherited character, since they are asso- 
ciated with alpha rhythm, also known to be hereditary. 

In view of the tremendous difficulties entailed in establishing any asso- 
ciation between ethnicity and individual differences, current research on this 
problem seems limited to small scale descriptive studies. Anastasi & D’An- 
gelo (4) examined Negro-white differences in language development and 
Goodenough Draw-a-Man IQ. Ina sample of 100 preschool children, no dif- 
ferences in IQ were significant; in language development, the only significant 
result was a race Xsex interaction wherein Negro boys developed faster than 
girls, while the opposite tendency (consistent with the results of previous 
research) was noticed among the white children. Perhaps this result could be 
traced to environmental and cultural factors. 

There is, however, increased interest in the effect of aging on individual 
differences. Vincent (113) finds that the decline of mean score on various 
intelligence tests with age can be regarded as practically constant, that this 
decline operates for both timed and untimed tests (provided they are in- 
fluenced only to a minor extent by knowledge), and that the mean annual 
decline from age 21 to 60 is approximately .03¢. Welford (115) presents ex- 
periments on the effect of aging on various types of skilled performances. He 
ascribes the resulting decline to failures of central receptor mechanisms. A fac- 
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tor analysis of the Wechsler-Bellevue test by Birren (15) is interesting be- 
cause its results compare very favorably with those of Cohen (26), previ- 
ously cited, for three neuropsychiatric groups. Birren finds four oblique fac- 
tors, three of which were identified as Verbal Comprehension, Closure, and 
Rote Memory. Comparison of the patterns of factor loadings suggests that 
these are the same, respectively, as Cohen’s Verbal, Nonverbal Organization, 
and Freedom from Distractability factors. Thus, Cohen and Birren differ 
only in the subjective interpretations which they give the factors, which are 
in any case highly similar. Since the two studies appear to have been per- 
formed quite independently, and even on different types of populations, the 
results constitute fresh and interesting evidence for the invariance of factor 
analytic results. Birren notes, incidentally, that his elderly sample shows loss 
on all three factors, but greatest relative loss on the closure factor. 

If the various types of ability decline differentially, it has been reasoned, 
comparisons of losses on different factors should result in a measure of mental 
impairment. It seems that investigators in this field have not adequately 
taken into account a factor-analytic rationale. Two recent research papers 
on this topic seem to do little more than illustrate the complexity of the 
problem. Garfield & Blek (51) emphasize that there is an interaction between 
age and mental impairment that must be taken account of. Corsini & Fassett 
(29) offer a substitute for Wechsler’s mental deterioration index which works 
better even when applied to Wechsler’s own data. 

If the data had been analyzed more thoroughly and rigorously, the results 
presented by Kurland & Gilgash (71) might have thrown further light on 
the operation of glutamic acid dosage with mental defectives. Administration 
of this drug for a month apparently produced a significant rise on the Wechs- 
ler-Bellevue test, but the effect seems to have been more marked on the per- 
formance scales than on the verbal scales. 

Environmental variables—While it is too early to expect follow-ups of 
the work of Eells et al. (39), several exploratory studies on socioeconomic 
status variables associated with intelligence tests have appeared. Estes (40) 
found status differences at age 714. A more careful look at some of Estes’ re- 
sults, however, shows that significant socioeconomic differences on certain 
scales (particularly the “‘verbal” scales) seem to be present even at age 10%. 
These scales are of course arbitrary combinations of scores on separate test 
performances, and not factor-analytically established scales. The differences 
associated with such a variable as socioeconomic status can be made mean- 
ingful only if they are in terms of linearly independent variables. Tate (109) 
compared groups of five-year-olds, contrasted in socioeconomic status, on 
the Leiter International Performance Scale and certain other tests of intelli- 
gence. The conclusion of the author that the Leiter International Perform- 
ance Scale is no more culture-free than the Stanford-Binet or the Arthur 
Performance Scale seems fairly well supported by her data. In the area of 
personality testing, Auld (7) reviewed the by now substantial amount of evi- 
dence that social class is associated with responses on various personality 
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tests, both objective and projective. Auld concludes that the psychologist 
must take these social class differences into account; otherwise, says Auld, 
the psychologist is “likely to assume a bias in favor of his own social class, 
the middle class.” 

Socioeconomic status is a gross variable which masks the specific influ- 
ences of various kinds of experience. Fortunately, at least a few studies of the 
effects of specific training appeared in the last year. A study by Klatskin 
(69) suggests that the currently popular “lenient” child-care procedures 
make for a slight rise in scores on infant intelligence tests. This sort of thing 
happens with laboratory rats, too, to judge from researches reported by 
Bingham & Griffiths (14) and Forgays & Forgays (48), who found that rats 
raised in infancy in ‘“‘rich’’ environments learn mazes better as adults; never- 
theless, these workers have not yet been able to pin down exactly what 
makes an environment “rich” for a newborn rat. Boger (16) conducted an 
experimental study of the effect of special exercises with blocks, geometric 
problems, and puzzles on the intelligence test scores of elementary pupils in 
rural schools. Experimental groups were significantly better than control 
groups after 3} months’ training, particularly on the spatial relationship 
part of the California Test of Mental Maturity. This finding suggests tenta- 
tively that the space factor, as it is usually identified, may be attributable 
partly to differences in experience with the tasks used in measuring this fac- 
tor. There are, in fact, disappointingly few studies to date concerning the 
effects of specific kinds of training upon the factorial compositions of tasks. 
One recent investigation from France, by Goguelin (54), is of interest despite 
the fact that the results cannot be neatly integrated with those obtained 
from samples in the United States. Goguelin made factor analyses of the 
same seven subtests of a mechanical comprehension battery for four groups 
with different amounts and kinds of technical training. The differences in 
the factor loadings are explained on the hypothesis that some tests were per- 
formed by technically trained people primarily on the basis of their tech- 
nical knowledge, but by nontechnical people primarily on the basis of what 
Goguelin calls “‘a more or less intuitive reasoning ability.” 

Perhaps because of recent world events and movements of peoples, there 
is a continuing interest in the effect of bilingualism on measured intelligence. 
Nothing in the extensive review by Darcy (34) of 110 references in the liter- 
ature would tend to set aside the hypothesis that bilingualism depresses 
scores on certain intelligence tests simply because the bilingual child (bi- 
lingual in the sense of using two languages, but not necessarily with equal 
competence) is not likely to have as good a mastery of the language of testing 
as a comparable unilingual child. Darcy herself concludes that bilinguals 
suffer from a language handicap in taking verbal tests, but that there is no 
indication of inferiority of bilinguals on nonlanguage tests, provided ade- 
quate controls have been exercised. Recent papers by Darcy (33), Anastasi 
& Cordova (3), and Johnson (66) show results consistent with these conclu- 
sions. Jones (67) found that in the case of Welsh-English bilingual children, 
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the difference between nonverbal and verbal IQ tends to diminish as reading 
age in English increases. 

There are several studies which deal with certain effects of the measure- 
ment procedure upon intelligence test scores. Peel (87) examined practice 
effects on Moray House intelligence tests; Sacks (98) found that the exam- 
iner’s establishment of pleasant social relations with three-year-old nursery 
school children makes for significantly higher tested I1Q’s; and Sarason & 
Mandler (101) showed that individuals with a high anxiety drive in the test- 
ing situation tend to perform more poorly than otherwise on group intelli- 
gence, aptitude, and achievement tests. If the Sarason-Mandler effect is a 
highly general one, it may mean that test scores ought to be adjusted for 
anxiety drive, both in research and practice. This is a problem which will 
doubtless receive more study in the near future. 


PsYCHOMETRIC METHODOLOGY 


Under the heading of psychometric methodology, the predominantly 
important topics are (a) techniques of identifying dimensions of individual 
differences, (b) problems involved in constructing measurements, and (c) the 
appropriate scaling of these measurements. These topics will be discussed 
in inverse order, however, because recent thought has emphasized the prob- 
lems inherent in measuring a unidimensional trait. 

Scaling theory.—An important contribution to the theory of all psycho- 
logical scaling (including both psychophysical and psychometric) is the 
monograph by Coombs (28), who desires to construct a mathematical model 
to fit behavior rather than to attempt to fit behavior to a preconceived 
model. Implied in this effort is the rejection of the numerical assumptions 
latent in many previous theories of scaling, with the result that in Coombs’ 
opinion scaling must be distinguished from measurement, which implies 
number. Coombs’ model concerns various procedures for seeking relations 
between individuals and stimuli. Values for individuals and for stimuli are 
found on scales which yield orders of magnitude rather than numerical 
quantities. Of particular interest in the present connection is Coombs’ 
mathematical definition of a perfect trait (perfect in the sense that it orders 
both individuals and stimuli in a corresponding way). Thus far, Coombs has 
not published any material explicitly applying his approach to mental test 
data, but such an application seems clearly possible. 

Mathematical theory of ability tests —The theory of mental tests has taken 
substantial forward strides in the hands of Lord (75), who shows that when 
the items of a test are measures of a single common factor of ability, the char- 
acteristics of the test score distribution can be predicted with high accuracy 
from the parameters relating to the difficulty and the validity of the items. 
He has confirmed with admirable mathematical rigor the hypothesis of sev- 
eral previous workers that the regression of test scores on true ability is cur- 
vilinear in the general case. His formulations also enable the prediction of 
bivariate frequency distributions between tests measuring a single ability 
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but composed of items with any specified difficulty values and validities. 

Further, it is found that the relations between item parameters and the 
reliability and discriminating power of a test can be precisely specified. These 
results are all for free response tests, that is, where the examinee has no op- 
portunity to pass an item by guessing. Later, however, Lord published two 
papers relating to multiple choice tests (76, 77), the second of which also 
presents formulae for maximum likelihood estimates of true scores. Cron- 
bach & Warrington (32) have come to several conclusions identical to Lord’s 
but chiefly by empirical rather than by mathematical investigation. Plumlee 
(91) developed formulae to show the effect of item difficulty and chance suc- 
cess on product moment item correlations and on test reliability. Finally, a 
mathematical investigation by Angoff (5) and an empirical study by Wes- 
man & Kernan (116) concern the problem of estimating the unspeeded reli- 
ability for time-limit tests. Many of these studies are of fundamental signifi- 
cance; unfortunately, their results are of too great complexity for further 
description here. It would appear, at any rate, that a theory of mental tests 
which employs orthodox quantitative concepts is of great practical value. 
Perhaps this is another way of saying that some of the quantitative assump- 
tions to which Coombs would take exception actually fit the data quite well. 

Other problems in test construction—Anastasi (2) showed sequential sam- 
pling of cases to be an efficient and accurate method of determining item 
parameters such as difficulty and validity. Kuang (70) compared three meth- 
ods of item analysis: F. B. Davis’s procedure, biserial r, and probit analysis. 
It was found that they produce tests about equal in reliability, but that they 
differ in the computational labor involved, the Davis technique being the 
fastest and probit analysis the slowest. Lorge & Kruglov (78) concerned 
themselves with the problem which arises when one wishes subject matter 
experts to estimate item difficulties in advance of any tryout of items. They 
found that estimates could be made more accurately when the judges are 
given anchoring information consisting of the difficulty values of representa- 
tive items tried out previously. 

Factor analysis methodology.—Guttman (61) clarifies certain relations be- 
tween Thurstone’s, Holzinger’s, and his own approaches to the computation 
of multiple group factors. In a critical review of statistical significance tests 
in factor analysis, Burt (21) summarizes several methods which may not be 
familiar to American investigators. He is in a generous frame of mind on the 
question of ‘‘when to stop factoring,’’ but more severe when it comes to ‘‘de- 
monstrating”’ factors. He is properly critical of the practice of testing the 
significance of residuals against the sampling distributions of zero correla- 
tions. His idea of testing the significance of the difference between observed 
and reproduced correlations further suggests a possible means of testing the 
significance of a factor loading even in an oblique reference frame, for possibly 
one could compare the observed reproduced correlation with the reproduced 
correlation under the hypothesis of a zero factor loading. 

Several solutions have been offered for finding the orthogonal structure 
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which best fits a given oblique structure (53, 56). Sandler (100) suggests 
that the rotation of factor axes can be facilitated by selecting certain persons 
as extreme deviates in the factors represented and, by Burt’s reciprocity 
principle, inserting these persons into the test space. Sandler does not, how- 
ever, give any convincing demonstration that his method would be useful 
when one does not know what the factors are in advance of the analysis. 
Carroll (24) has recently offered a method for obtaining an approximation to 
simple structure by completely objective techniques rather than by the usual 
graphical procedures; unfortunately, the method presently requires almost 
prohibitive computations. 

Other techniques and problems.—Much interest has been evinced lately in 
the multiple discriminant function, designed to give a best-fitting set of 
weights for tests in descriminating between k groups. Tiedeman & Stern- 
berg (111) have provided an illustration of the possibilities of this technique 
in curriculum guidance in secondary schools, but Webster’s (114) paper sug- 
gests that the technique may also be useful in the investigation of individual 
differences as such. For example, in his study the technique disclosed a result 
that otherwise might not have been apparent, namely, that certain Thematic 
Apperception Test variables significantly discriminated ‘dominant’? from 
“submissive”? men only among those who were ‘‘secure.”’ 

Sidman (105) cautioned against the rather common practice of educing 
functional relations from group data when these are actually averages from 
individual curves. Averaging, he pointed out, may mask the possibly differ- 
ing forms and parameters of individual curves. 


PsYCHOMETRIC INSTRUMENTS 


Attention should be called to the publication of Buros’s Fourth Menta! 
Measurements Yearbook (19). Buros has performed a distinct service for all 
persons concerned with tests in continuing to provide a comprehensive list- 
ing and review of currently available instruments. 

New tests —Davis & Eells (36) brought out a series of tests for primary 
and elementary school children which they presume to be culture-fair for 
children of different socioeconomic levels. Two attempts have been made to 
simplify the Vigotsky test for clinical use (90, 103). These researches fail, 
however, to clarify what the Vigotsky test measures. Apparently, it does not 
measure ‘‘intelligence’’ in the usual sense. 

Work has continued on the development of abbreviated scales of intelli- 
gence. Hunt & French (64) offer extended norms for their so-called CVS 
Abbreviated Intelligence Scale and provide evidence of its validity for 
rough screening on intelligence and for certain kinds of psychiatric classifi- 
cation. 

Standardization and evaluation of tests——Numerous investigations con- 
cerning the standardization of intelligence tests continue to appear. Repre- 
sentative of these are: Roberts & Mellone’s (93) study of the comparability 
of Terman-Merrill IQ’s at different ages; Justman & Wrightstone’s (68) study 
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of the Pintner and the Henmon-Nelson tests; Holland’s (63) comparison, 
for normal children, of the Wechsler Intelligence Scale for Children (WISC) 
and the Stanford-Binet; Delattre & Cole’s (38) comparison of the WISC and 
the Wechsler-Bellevue, again for normal children; and Beverly & Bensburg’s 
(12) comparison of the Leiter, Cornell-Coxe, and Stanford-Binet tests as 
applied to mental defectives. Typically, these studies report differences in 
mean level of IQ or mental age obtained with different tests; in view of the 
high correlations usually reported (which are not at all surprising in view of 
the generally common factorial content of the tests), it would perhaps be 
more useful if these writers were to report, in addition, detailed tables of 
score equivalences. 

Millard (82) and Wickert (117) agree in finding that despite its title, the 
test known as How Supervise?, designed for measuring knowledge of fore- 
manship and industrial supervision, functions essentially as an intelligence 
test, at least for nonhigh school graduates. If it measures knowledge of prin- 
ciples of supervision at all, it does so only for highly educated groups. These 
results may be partly accounted for by Maloney’s (80) finding that the test 
is at the “difficult” level of readability. It may be commented that the 
readability of a test, or of the test instructions, is probably often a good indi- 
cation of the extent to which it measures the verbal comprehension factor of 
ability. 
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Department of Psychology, Yale University, New Haven, Connecticut 


In agreement with Sears (37), the writer feels that the subject matter of | 
personality study may usefully be classified into three general categories: | 
personality structure, psychodynamics, and personality development. These | 
categories are not rigidly separated, but are useful labels for certain cluster- 
ings either of theoretical interest, of research methods, or of type of evidence 
sought for. Within each of these categories, of course, there is no sharp bound- 
ary between general study of personality and such specific interests, covered 
elsewhere in this volume, as the investigation of child behavior, abnormal 
behavior, and techniques of assessment and diagnosis. The writer has no 
interest in helping to preserve the integrity of personality study as an isolated 
entity, welcoming instead the fact that as general psychology becomes more 
adequate to deal with the whole range of human behavior there is ever less 
occasion to recognize personality study in the traditional sense as a discipline 
in any way distinct. The general approach adopted in planning this chapter, 
has been that of concentrating on those topics in personality study that seem 
least likely to receive extended attention in other chapters. Within the rather 
restricted yet heterogeneous field that is delimited in this way, the writer 
has none the less been highly selective. 

One topic, psychodynamics and performance, will receive extended at- 
tention just because it happens to be especially conspicuous in the research 
studies published during the past year. Several other topics in psychody-' 
namics will then be treated more briefly. Illustrative of the lack of rigid sepa- 
ration among the three categories of personality study mentioned above is 
the fact that much of the research reviewed under these topics might, with 
a slight shift in theoretical interest, be classified under personality develop- 
ment or, if attention were focussed on the kind of evidence appealed to, be 
classified under personality structure. 

Within the delimited field reviewed here, there has been so little research 
during the past year on personality development that this category will - 
omitted from separate consideration. There remains the study of personality 
structure. Here the reviewer has been highly selective in a way that is not 
easily communicated; he has simply picked out for special attention certain 
research which seemed to him, for diverse reasons, especially likely to be of 
great general interest. 


1 The survey of the literature pertaining to this review was completed in May, 
1953. 


2 The following abbreviations were used in this review: MMPI (Minnesota Multi- 
phasic Personality Inventory); US (unconditioned stimulus). 
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PsYCHODYNAMICS AND PERFORMANCE 


The past year has seen a remarkable burst of research publication on 
psychodynamic influences on intellectual performance; it is remarkable, in 
part, for the extent to which various scholars working independently of each 
other but under the common influence of the general state of the science, 
have formulated similar hypotheses and tested them in similar fashion. Ear- 
lier work has been reviewed, and theoretical integration has been offered, 
by Lazarus, Deese & Osler (29) and by Child & Waterhouse (13, 14) and 
these efforts are being brought more up to date by Deese & Lazarus (16). 
In view of this fact, what seems most appropriate here is a review of the 
developments that seem most distinctive during the past year. As it happens, 
these developments are particularly appropriate in a review on personality, 
for they are concerned with personality variables in relation to the psycho- 
dynamics of performance. In particular, three separate measures of per- 
sonality variables, apparently developed quite independently, have been 
used in these studies. 

General scale of manifest anxiety.—The first of these variables is the meas- 
ure developed and most fully reported by Taylor (43). It is based on a ques- 
tionnaire which consists of anxiety items taken from the Minnesota Multi- 
phasic Personality Inventory, together with buffer items. Those items were 
first selected which four out of five clinicians agreed to be indicative of mani- 
fest anxiety, but about a quarter of the items were later eliminated for lack 
of high correlation with total score. On the basis of the way the scale was 
constructed, it has been widely taken as a measure of manifest anxiety. 
Recently evidence has become available which may have a direct bearing 
on its validity, though the evidence certainly does not lead to any unambigu- 
ous conclusion. Taylor (43) reports a very large difference in average score 
on this scale between normal subjects and psychiatric patients, and suggests 
that this may constitute some positive evidence for the validity of the scale. 
Holtzman, Calvin & Bitterman (27) report very high correlations between 
the Taylor scale and a scale developed by Winne (49) from the MMPI? by 
the criterion of high discrimination between normals and neurotics, and they 
too suggest that this is positive evidence for the validity of the Taylor scale. 
Deese, Lazarus & Keenan (17), on the other hand, confirm the latter finding 
and also report a high correlation between the Taylor scale and a scale of 
psychasthenia (largely a result of overlapping items) but feel that these re- 
lationships cast some doubt on the simple view of what the Taylor scale 
measures which is represented in most of the publications making use of it. 
Actually the most pertinent evidence on validity, in relation to the purposes 
for which the scale has been used, may be that provided by an incidental 
finding of a study by Bitterman & Holtzman (4). Thirty-seven subjects 
were divided into high and low anxiety categories by a conference of four 
raters, who made use of various MMPI scores, the Rorschach, and informa- 
tion about response to experimental stress. The relation of this dichotomy to 
one based on the Taylor scale alone was not significant. These various facts 
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suggest the difficulties that may generally be encountered when a complex 
personality scale comes to be used not for some practical aim for which dis- 
crimination may be useful even if the basis of discrimination is not under- 
stood, but instead for theoretical purposes which require relating scores to a 
single theoretically defined intervening variable. For the other personality 
measures which will be reviewed later, there is no parallel evidence to com- 
plicate the picture of their possible meaning, but this is doubtless just be- 
cause inquiry into their validity has not been pushed as far. 

This scale of general manifest anxiety was first used by Taylor (42) in a 
study in which extreme groups, very high or very low on the scale, were com- 
pared with respect to eyeblink conditioning. She found significantly more} 
conditioned responses in the high anxiety group and interpreted this as an 
outcome of higher drive level in this group. (She pointed out that the effect 
of drive might be either directly on performance or by way of an influence on 
the habit-strength increment resulting from each rewarded response, or both 
together.) This finding was confirmed by Spence & Taylor (40), separately 
for two intensities of the air puff which served as US,? and by Spence & 
Farber (39); the latter authors also showed that the higher performance level 
of the high anxiety subjects was maintained during extinction. A study by 
Hilgard, Jones & Kaplan (26), using unselected subjects from the whole nor- 
mal range, instead of preselected extreme groups, confirmed the relation be- 
tween conditioning and anxiety score only as to direction, with results which 
are far short of being significant. However, one study by Spence & Taylor 
(41), which used an unselected group, did significantly confirm the finding 
which had otherwise been based on extreme groups alone. The evidence 
adds up to a perfectly clear conclusion at the empirical level. In eyeblink 
conditioning, performance level is positively correlated with score on the) 
Taylor scale. It seems likely, moreover, that this is a more or less continuous 
relationship throughout the normal range (41), though Spence & Taylor (41) 
mention unpublished studies which indicate that the relationship may be 
curvilinear rather than rectilinear. The finding of faster conditioning with 
high anxiety is, it should be added, consistent with earlier studies of condi- 
tioning rate as a function of pathological anxiety [see Welch (46)]. That 
the finding has some generality, at least for conditioned responses based on 
anxiety-arousing unconditioned responses, is suggested by the fact that these 
earlier studies have used stimuli and responses different from those used by 
the Iowa group, as has a recent study on normal subjects by Bitterman & 
Holtzman (4). 

From these findings about simple conditioning, the Iowa studies go on« 
to consider the possible effects of anxiety as a variable in more complex 
learning situations. The principal lines of theoretical reasoning used here 
are those presented by Taylor & Spence (44) and repeated by Farber & 
Spence (24). The theoretical distinction they make among various learning. 
tasks has to do with the number of competing response tendencies aroused. 
In simple conditioning, the situation is so controlled that a single stimulus- 
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response tendency is aroused, with no appreciable competition; here the ef- - 
fect of increased anxiety is simply to improve performance through increas- 
ing the drive which, by Hullian theory, enters as a multiplier in determining 
the strength of this single S-R tendency. But in a more complicated learning 
task, such as serial rote learning, there are likely to be several competing 
tendencies present at any one point, and heightened drive will, according to 
Hullian postulates, have a multiplicative effect upon the strength of all 
these tendencies. If the correct response is not the one with greatest habit 
strength, the result will be to increase the advantage in response strength of 
those incorrect responses which are above it in the response hierarchy. Thus 
the probability of evocation of the correct response will be lowered, and the 
persistence of the dominant incorrect responses will be more protracted<so 
that during this period performance of subjects with high anxiety will be 
inferior to that of subjects with low anxiety. This much seems to be a thor- 
oughly sound and direct derivation from Hullian theory. But at this point 
the theoretical reasoning of Taylor & Spence (44) becomes, in the opinion of 
the reviewer, rather careless. For they also suggest that if the correct re- 
sponse is the one of greatest strength, then the responses lower in the hier- 
archy will have a greater probability of being evoked in the presence of high 
anxiety than in the presence of low anxiety! Since they mention, in this con- 
nection, response threshold and behavioral oscillation, which they happen 
not to mention in connection with the preceding argument, the two argu- 
ments appear, at first glance, not to be contradictory. In the opinion of the 
reviewer, however, the two arguments as presented are contradictory, and 
only the first one seems to represent a correct derivation from the postulates 
employed. The same view is perhaps implied in a paper by Montague (33), 
for in a similar theoretical discussion he employs only the first of these two 
arguments. 

By whatever mode of reasoning, then, Taylor & Spence predict that in 
situations where numerous incorrect response tendencies are aroused high 
anxiety should make for poorer performance than low anxiety; this effect is 
just the opposite of that found in simple conditioning. The available evi- 
dence will be reviewed in relation to this prediction before returning to the 
problem of theoretical interpretation. Several studies in the Iowa series re- 
port the relative performance of high anxiety and low anxiety subjects on 
learning tasks where incorrect response tendencies are likely to be conspicu- 
ous. Taylor & Spence (44), using a sort of verbal multiple-T maze, found high 
anxiety subjects to be significantly inferior to low anxiety subjects. Lucas 
(30) found high anxiety subjects to be inferior in immediate memory for con- 
sonant lists which contained confusing duplications. Farber & Spence (24) 
obtained the same result with a stylus maze. These latter authors, moreover, 
had employed some of the same subjects on the stylus maze and in a condi- 
tioning study. Normalizing the scores on each task to render them compa- 
rable, they were able to make a direct comparison of performance by the 
same high and low anxiety subjects in the two kinds of learning situations. 
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There is a clear interaction, with high anxiety subjects being inferior to low 
anxiety subjects in maze learning and superior to them in simple condition- 
ing. 

The theoretical considerations from which this gross variation of the 
effect of anxiety according to type of task was predicted would, of course, 
also lead to a prediction of quantitative variation of the effect according 
to the extent to which strong competing response tendencies are aroused. 
Two approaches have been used in an attempt to verify this prediction. 
One is to investigate the magnitude of the anxiety effect as a function of 
the difficulty of particular choice points in the maze, on the obvious as- 
sumption that greater difficulty implies more successful competition from 
incorrect response tendencies. Taylor & Spence (44) and Farber & Spence 
(24) both report a high correlation between difficulty of choice point and 
amount of difference between the two groups of subjects in performance on 
that choice point, and regard this as some confirmation of the theoretical 
prediction. To the reviewer this evidence carries no conviction, for it seems 
too likely to be a mere artifact of the system of measurement employed. The 
range of difficulty, at least in the data of Farber & Spence’s study, is from a 
low percentage of errors up to something in the vicinity of 50 per cent of 
errors, and it seems likely that an effect essentially constant in nature would 
vary in magnitude through this range in just about the observed manner; 
this would be true, for example, of the effects of sampling error. (At the same 
time, both these pairs of authors note that anxiety has a detrimental effect 
even on easy choice points to an extent that is perhaps incompatible with 
their assumptions alone; in the opinion of the reviewer, this finding suggests 
the influence of other competing responses made specifically to the anxiety, 
such as will be mentioned again later.) More convincing evidence is actually 
provided by a finding reported by Farber & Spence in a slightly different 
connection. When high anxiety and low anxiety subjects are matched for 
total score on the stylus maze, significant evidence (at the 5 per cent level) 
is found that the high anxiety subjects perform better than the low anxiety 
subjects on the easiest choice points, while performing worse on the most 
difficult choice points. 

The second approach to this problem of quantitative variation of the ef- 
fect with amount of response competition consists of experimental intro- 
duction of variables which should directly influence amount of response com- 
petition. In an experiment by Lucas (30) the experimental variable intro- 
duced for this purpose was the extent to which, in a list of consonants pre- 
sented in an immediate memory task, some of the consonants were repeated 
in differing locations within the list. Significant evidence was found that with 
either two or five such duplications, the inferiority in performance of a high 
anxiety group was greater than with no duplications. Montague (33), in a 
study of serial rote learning, introduced two experimental variables for this 
purpose, intra-list similarity and association value of the nonsense syllables 
used. Some evidence was obtained that both high intra-list similarity and low 
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association value, presumably both making for relatively strong competing 
responses, tend to produce a decrement in performance in high anxiety sub- 
jects in comparison with low anxiety subjects. 

These various lines of evidence do add up to a convincing demonstration 
that as the task becomes more complex (in the sense of involving conflict 
among various response tendencies) there is a tendency for high anxiety sub- 
jects to show increasingly poor performance in comparison with low anxiety 
subjects. If one shares the view of Taylor & Spence (44) that this relation- 
ship would be expected from a simple application of Hullian theory regard- 
less of whether the correct response is highest in the response hierarchy, then 
this conclusion tends to confirm this simple application of Hullian theory. If, 
with the reviewer, one feels that their prediction here is faulty, then the 
theoretical implication of these findings is not so clear, for the simple ap- 
plication of Hullian theory would lead to this prediction only in those cases 
where the correct response is not at any given point dominant in the hier- 
archy. In this event, there is no clear theoretical basis for predicting the over- 
all direction of effect of anxiety on performance in a complex task. To verify 
this interpretation one would need a trial-by-trial analysis to determine 
whether high anxiety subjects show greater stability of both their correct 
and their incorrect responses than do low anxiety subjects. One can certainly 
agree with Taylor & Spence (44, p. 64) that “further experimentation needs 
to be carried out in learning situations in which the number and relative 
strengths of the responses competing with the correct one can be controlled 
and manipulated.”’ But at the present stage of knowledge, the reviewer is of 
the opinion that the most plausible general interpretation of these findings 
about task complexity is that the disruptive effects of various responses to 
anxiety vary with the nature of the task: that in simple conditioning, where a 
stable relationship is established between a single stimulus and a single re- 
sponse, what internal responses the subject is making at the time do not 
have any great effect, whereas the presence of high drive level does make for 
heightened performance; but that in complex situations, where the subject 
is already in conflict between various response tendencies relevant to the task, 
the presence of irrelevant responses made to anxiety heightens the conflict 
and interferes with performance to a greater extent than the increased drive 
improves it. In those specific instances where the correct response tendency 
is not the strongest of the task-relevant response tendencies, of course, the 
simpler sequence of events outlined by Taylor & Spence should be operating 
also to further strengthen this effect. 

This line of interpretation is recognized both by Montague (33) and by 
Lucas (30), though it is made use of only by Lucas. In the reviewer’s opinion, 
it is an essential line of interpretation (for reasons which will be indicated in 
dealing with personality variables other than the Taylor scale). The Iowa 
studies seem to have missed its importance thus far for two reasons. The first 
reason has to do with the nonexperimental nature of the Iowa studies. In 
view of the origin of these studies in a strongly experimental tradition, it isa 
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curious fact that with the single exception of the Lucas study they have made 
no attempt to vary anxiety experimentally but have been content to deal 
with anxiety exclusively as a preexisting personality variable. The Lucas 
study illustrates what is also shown in other studies to be reviewed below, 
that when anxiety is simultaneously varied experimentally and personality- 
wise, complications appear. For example, Lucas found that when anxiety 
was varied experimentally by varying the number of failure experiences, one 
class of subjects showed a decrement in performance with increasing anxiety 
while another class of subjects showed an increment. The interpretation 
which seems to be called for, and which Lucas was not able to supply within 
the simplified conceptual framework he was using, is that different classes of 
subjects (such as those high and low on the Taylor anxiety scale) have dif- 
ferent response tendencies evoked by situationally produced anxiety. This is} 
the second reason that the Iowa studies have missed this essential line of 
interpretation—that despite their origin in a framework of learning theory’ 
they have paid no attention to possible differences among subjects in their 
previously learned response-tendencies to the cues provided by anxiety. It is 
in connection with these two points that the Iowa studies are valuably sup- | 
plemented by the studies now to be reviewed which have used other personal- | 
ity variables. 

Scale of anxiety in the testing situation—Mandler & Sarason (31) have 
developed a questionnaire which, like the Taylor questionnaire, is intended! 
to measure anxiety as a personality variable. A more specific measure is at-\ 
tempted here, however; it is a measure not of general anxiety but of anxiety} 
in the situation of being tested. The hypotheses that have been tested with ! 
the Taylor questionnaire all have to do, of course, with effects of anxiety as 
aroused in the situation in which performance is measured. To the extent, 
then, that individuals differ with respect to anxiety arousal by different kinds 
of situations, and that questionnaires can really discriminate between anx- 
iety in general and anxiety in one class of situation, the Mandler-Sarason 
questionnaire should be an even more appropriate and precise instrument for 
verifying hypotheses about the effects of anxiety when the hypotheses pre- 
dict performance in a situation in which the subject is being tested. This 
questionnaire asks the subject not about anxiety in general or in a variety of! 
situations, but only about anxiety in connection with being tested (in course 
examinations, in individual intelligence testing, and in group intelligence; 
testing). 

Mandler & Sarason (31) offer one item of direct evidence for the validity 
of their questionnaire. The examiner, testing subjects without any knowledge 
of their questionnaire responses, made ratings of overt manifestations of 
anxiety in the testing situation. These ratings yielded a point correlation of 
.59 with anxiety tendency as measured by the questionnaire. Even though 
the subjects were from the extremes of the distribution of questionnaire 
scores, this seems to the reviewer to be impressive direct evidence of validity. 

This measure of test anxiety has been studied in relation to performance 
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on several kinds of tasks. All of these tasks are ones of high complexity, and 
attention was not focussed on task differences. In comparison with the Iowa 
studies, these studies have the special value of presenting evidence on the 
interaction of personality differences not with task variables but with a set of 
experimental variables which might all be regarded as falling in the general 
class of experimental manipulation of anxiety arousal. Review of these stud- 
ies here will be concentrated on this problem; as there was no such similar 
concentration of attention on this one problem in the original studies, how- 
ever, it will be noted that some of the statistical tests relevant to this problem 
were not made in the published articles and have to be roughly estimated for 
present purposes. 

In a series of three experiments Mandler & Sarason (partly in collabora- 
tion with Craighill) have explored the ways in which the dependence of test 
performance upon experimentally aroused anxiety varies according to status 
of the subject on the personality variable of test anxiety. Throughout these 
studies the general procedure has been to use subjects who are preselected 
for having either extremely high or extremely low scores on the test anxiety 
questionnaire, and then to subject each of the two extreme groups to identi- 
cal variations of experimental procedure. 

In the first of these studies (31), performance was measured on the Kohs 
block-design test, and the experimental variable consisted of failure, success, 
and neutral stimulation. This variable was introduced through statements 
of the examiner, informing the subject either that he had done very poorly 
on the preceding tests, or that he had done extremely well (accompanied by 
a challenge to continue doing as well), or else making no reference to stand- 
ards or to previous performance. The effect of failure versus neutral stimu- 
lation is the finding most closely parallel to that of other studies. The tend- 
ency was for failure stimulation (in comparison with neutral) to produce a 
decrement in performance in the high anxiety group, but to produce an in- 
crement in performance in the low anxiety group. Only the second of these 
two effects reaches ordinary standards of statistical significance; no test is 
made of the difference between the two groups in the effect, but it seems prob- 
able that it would be significant. The interpretation offered has to do, of 
course, with personality differences between the two groups. The high anx- 
iety group, it is suggested, have a strong previously acquired tendency to 
react to anxiety, in a test situation, with responses not relevant to the task, 
and the reference by the examiner to the test situation and to failure in it 
serves to elicit such responses more strongly than before and thus to inter- 
fere with adequate performance. The low anxiety group, on the other hand, 
have little tendency to make task-irrelevant responses; when some degree of 
anxiety is evoked by the mention of failure, it is then supposed, anxiety 
contributes to the total drive strength motivating task-relevant responses 
and thus leads to improved performance. 

A subsidiary finding of great importance because of its novelty, even 
though it is not statistically significant, has to do with the effects of success 
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stimulation. A tendency appears for success stimulation to operate in essen- 
tially the same way as failure stimulation; that is, for success stimulation 
(in comparison with neutral stimulation) to depress performance in the high | 
anxiety group and to improve performance in the low anxiety group. The | 
interpretation offered is that reference to performance in the test situation 
in connection with success tends to elicit internal responses in about the same 
way as in connection with failure, differentially for the high and low anxiety 
groups. If this finding is further confirmed, of course, it has important meth- 
odological implications, for it suggests that attempts to control failure 
stimulation by success stimulation do not provide an appropriate control, 
that in certain respects these two types of stimulation have identical effects 
which can be demonstrated only by having some clearly neutral kind of 
stimulation as a control. In addition, of course, this finding would contribute 
to clarifying just what it is about frustration or stress which influences per- 
formance, by indicating that it is something which is shared by frustration 
and by certain kinds of clear success, the presence of stimulation which elicits 
anxiety about evaluation of future performance in relation to goals and stand- 
ards of performance. 

In a second experiment (36), the experimental variable of anxiety arousal 
consisted of another kind of variation in statement to the subject. A digit 
symbol test was used, of sufficient length that no one could finish any trial 
in the time allowed. In one condition, subjects were told that they should be 
able to finish in the time allowed; in the other condition, they were told that 
they were not expected to finish. In the low anxiety group, the expect-to- 
finish instructions produced very markedly better performance than the 
other instructions. The interpretation made is similar to that offered in the 
first experiment; the expect-to-finish instructions led to increased drive 
which, in these subjects, served to motivate task-relevant responses. The in- 
structions might be thought of as frustrating in character (and similar in- 
structions have been used by other experimenters with the intention of pro- 
ducing frustration), and it is of interest, in this connection, to note that the 
effect was apparently just about as large on the first trial, when the subjects 
had had no opportunity to discover that they would in fact not be able to 
finish, as on subsequent trials which had been preceded by this frustrating 
discovery. Again, then, there is a suggestion here that some of the effects of 
frustration on performance are indirect results of direct effects which may 
also be produced in other ways than by frustration. 

In the high anxiety group, an opposite effect of this instruction variable 
was predicted, on the grounds that such subjects would predominantly react 
to the statement that they should finish with task-irrelevant responses. 
Such a reversal of effect was found on two of five trials, but not on the other 
three. There certainly does, however, appear to be a clear difference in effect 
between the two anxiety groups which is probably statistically significant, 
though it was not tested. 

In a third experiment (36), the experimental variable was ego-involve- 
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ment as manipulated through instructions. As in many previous experiments 
on ego-involvement, half the subjects were told that the purpose of the test- 
ing was individual evaluation while the other half were told that the experi- 
menter’s interest was in studying the task. The task used was a stylus maze. 
Rather consistently, ego-involvement favored better performance in the 
low anxiety group, and poorer performance in the high-anxiety group, though 
most of the differences which the authors tested were not statistically signi- 
ficant. 

In the theoretical argument presented in their first paper (31), Mandler 
& Sarason also arrived at certain predictions about absolute differences in 
performance between high anxiety and low anxiety subjects, which derived 
from essentially the same theoretical notions which have been referred to 
above in describing the experimental aspects of their work. Of necessity, 
these predictions cannot be as definitively tested because of the possible role 
of ability differences as sources of, rather than results of, anxiety level. In all 
three articles (31, 35, 36) published by this group, however, there are results 
which on the whole are consistent with predictions of this sort which they 
make. Their theoretical statement also includes a prediction that high anxiety 
subjects should under favorable conditions of repeated and rewarded practice 
learn to respond to their anxiety with appropriate task-relevant responses, 
and thus have a motivational resource for improvement through practice 
which exceeds the parallel resources of the low anxiety subjects. The item of 
evidence which to the reviewer seems most striking as a confirmation of this 
suggestion is a nonexperimental finding reported in their third article (35). It 
was found that scores on aptitude tests taken upon college entrance have a 
definitely lower average in high anxiety subjects than in low anxiety sub- 
jects, whereas the relation between the two groups in college grade average 
is slightly reversed. No test was made of the significance of this difference in 
relationship, but it appears probable that it would be significant. The inter- 
pretation made by the authors is that in progress through a college course 
the high anxiety person has the opportunity to learn to respond to his anx- 
iety with task-relevant responses, so that the disadvantage he has in novel 
anxiety-arousing situations (such as taking intelligence tests) disappears or 
may even in time be turned into a motivational advantage. 

These various studies done with the Mandler-Sarason scale of test anxiety 
seem to justify a definite conclusion. It is that the subjects with high test 
anxiety differ from the subjects with low test anxiety in the habits of re- 
sponding to anxiety which have been built up through the course of their 
life. High anxiety subjects evidently have habits of responding to anxiety 
with various responses, ifiternal and external, which are incompatible with 
efficient pursuit of a complex task; hence these subjects do worse in a situa- 
tion which evokes much anxiety (through ego-involving instructions, an- 
nouncement of failure, etc.) than in a situation which does not evoke much 
anxiety. Low anxiety subjects, on the other hand, evidently lack strong hab- 
its of responding to anxiety with task-irrelevant responses, so that anxiety- 
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arousing instructions have as their main effect on these subjects an improve- 
ment of performance through increase in drive. (The increase in drive in an 
anxiety-arousing situation must of course be supposed to occur in the high 
anxiety subjects also, but its direct effect in improving performance to be 
obscured through interference by the task-irrelevant responses made to it.) 
It seems to the reviewer that these facts about the interaction of anxiety as 
a personality variable and as an experimental variable rather conclusively 
eliminate the simple application by Taylor & Spence (44) of the response- 
hierarchy concept as the main explanation of performance decrement with 
high anxiety, though their explanation may indeed be essential and adequate 
for certain simple instances. For the more general problem, it would appear 
that the preexperimental acquisition of response tendencies to anxiety as a 
cue must be taken into account. Here perhaps are the ‘“‘new theoretical as- 
sumptions regarding the properties of anxiety and failure’ which Lucas- 
(30, p. 65) suggests are called for in order to explain those findings of his 
which parallel those of the Mandler-Sarason studies, though he suggests in- 
stead a different set of assumptions. He suggests, in effect, that as the level 
of anxiety increases (either as a function of personality or as a function of 
experimental manipulation), favorable effect on task-relevant responses in- 
creases first more rapidly, and then less rapidly, than the effect on interfering 
responses. The reviewer would agree that such an assumption may be neces- 
sary, particularly to explain why high anxiety subjects sometimes perform 
better than low anxiety subjects, even on complex tasks, in situations that 
are relatively nonanxiety-arousing. But if one searches for a theoretical back- 
ground for this otherwise ad hoc assumption, it seems likely to be found in 
knowledge of the circumstances under which irrelevant response-tendencies 
have been learned by various persons, and hence of the cuing off of those 
responses by different levels of anxiety in different persons. In that case, one 
comes back to the necessity for attention to individual differences in the out- 
come of previous learning as an essential part of an adequate explanation. 

Scale of tendency to make interfering responses to frustration.—The third 
personality variable which is relevant to this topic is one which has been used 
only in two experiments reported in a single paper by Waterhouse & Child 
(45). It is a measure derived from a questionnaire intended to sample the 
individual’s characteristic responses to frustration, with respect to strength 
and pervasiveness of tendency to react to frustration with responses which 
would tend to interfere with adequate performance of a task. In the principal 
experiment, a variety of intellectual and motor tasks were employed, and 
frustration was introduced experimentally by informing half the subjects 
that they were doing very poorly. Subjects above the median on the inter- 
fering response questionnaire performed worse under frustration than under 
neutral conditions, whereas an opposite effect was found in subjects low on 
the interfering response questionnaire, and this interaction was statistically 
significant. In a subsidiary experiment this finding was confirmed, though not 
quite at the 5 per cent confidence level, with different tasks, subjects, and 
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mode of inducing frustration. This finding is now so familiar from the Lucas 
and the Mandler-Sarason studies that there is no need to elaborate it or its 
interpretation here. The novelty in this study is that an attempt was made 
to measure directly a tendency to make interfering responses, whereas this 
tendency was in the other studies only introduced hypothetically as an ex- 
planatory device. In the other studies it was assumed that strength of anxety 
proneness as a personality variable could be taken as an index of tendency to 
make interfering responses to the cues provided by anxiety. The fact that 
the Waterhouse-Child scale of interference tendency shows a correlation 
only in the neighborhood of .50 with the Mandler-Sarason scale of test 
anxiety (35) may possibly justify an assumption that the two variables have 
a degree of independence sufficient to warrant their joint use. Particularly if 
measures of anxiety proneness and interference tendency could be made 
purer and more independent of each other, joint use of the two measures 
might be useful in seeking further disentanglement of the two opposing ef- 
fects of anxiety on performance. 

Miscellaneous studies of psychodynamics and performance.—In addition 
to the studies already reviewed, which have used personality measures es- 
pecially devised for theoretical relevance to this problem, several other re- 
cent studies have been concerned with the relation of personality character- 
istics to psychodynamic influences on performance, measuring personality 
characteristics with standard tests. One test used in this connection is the 
| Rorschach. Several years ago Williams (48) reported a very high relationship 
between certain Rorschach variables and performance changes under ex- 
perimentally induced stress. Eriksen, Lazarus & Strange (21), in a somewhat 
comparable experiment, have failed to find any significant evidence of rela- 
tionships between Rorschach variables and performance changes under stress. 
This outcome, and the similar outcome of several other comparable studies 
as reported or summarized by Carlson & Lazarus (8), lead the reviewer to 
suppose that Williams’ findings may have been a product of sampling error. 
For lack of repetition of experiments, there are probably many statistically 
significant conclusions which are widely accepted despite being a product of 
only sampling error, and this recent work provides an instructive example 
of the value of repetition or near-repetition of psychological studies. 

Another test used for a similar purpose is the Rosenzweig Picture-Frus- 
tration Test. Hybl & Stagner (28) studied reaction to stress on three fairly 
complex tasks, and administered the P-F Test to their subjects. Two groups 
of subjects selected as showing maximum or minimum disruption under 
stress showed several significant differences on the P-F Test. For example, 
those most affected by stress show significantly higher scores on intropunitive 
and lower scores on impunitive, tendency than do those least affected by 
stress. Perhaps the results may be viewed as tending to confirm the idea that 
disruption of performance under stress is associated with a general tendency 
to react to frustration with disrupting response tendencies, but the reviewer 
is not well enough acquainted with the P-F Test to judge with any confidence. 
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Interpretation is complicated by the fact that the high and low groups here 
differ with respect to psychiatric diagnosis as well as stress reaction. To the 
thesis that response to stress is a function of general personality character- 
istics, the Hybl & Stagner study also contributes by finding very high corre- 
lations among the various complex tasks used, with respect to the effect of 
stress upon performance. 

Certain other studies published during the past year, while not concerned 
with personality variables, fit in well with some of the findings of the research 
already reviewed. Thus the point that the effects of anxiety vary with dif- 
ferent kinds of tasks is reinforced by a finding by Moldawsky & Moldawsky 
(32), that in a group of college students performance on a digit span test was 
disrupted by an anxiety-inducing procedure, where performance on a vocabu- 
lary test was not. The important role of interfering responses evoked by a 
frustrating situation is stressed by Child & Waterhouse in two theoretical 
articles (13, 14). Particularly pertinent to the present discussion is their 
critique (13) of the Barker, Dembo & Lewin (2) experiment on regression. 
The broad generalization often drawn from the Barker, Dembo & Lewin) 
study, that frustration tends to produce regression (in the sense of primitiva-| 
tion or disorganization of behavior), can hardly go unchallenged today.’ 
Child & Waterhouse attempt to show, partly through reanalysis of the data‘ 
published by Barker, Dembo & Lewin, that that experiment demonstrates 
primarily that the extent to which regression (in this sense) results from frus- 
tration is a function of the extent to which the specific occurrence of frustra- 
tion gives rise to response tendencies incompatible with high level perform- 
ance in the behavior which is being observed. 


AGGRESSION 


Scientific evidence on the determinants of aggression as a reaction to 
frustration has received a notable addition in the demonstration by Pastore 
(34) of the importance of arbitrariness of the frustrating situation as an in- 
fluential variable. The evidence was derived from a study based on essen- 
tially the technique used by Doob & Sears (18), in which subjects are asked 
to indicate their probable reactions to a number of frustrating situations. 
Pastore prepared descriptions of situations in pairs (to be presented to dif- 
ferent subjects), distinguished by whether the social origin of the frustration 
lay in an arbitrary act of another person or in a more justified act. Aggressive 
responses were much more numerous for the more arbitrary set of situations. 
The theoretical significance of this finding is certainly not clear, as Pastore 
recognizes in discussing various interpretations that might be made. But the 
clear demonstration of the importance of this variable should be a useful 
stimulant for further research on this problem which has been so oddly neg- 
lected in psychological research. 

One way in which the relationship between arbitrariness and aggressive 
reaction might arise, of course, is through characteristic sequences of learn- 
ing in the life history of individuals in our society, or perhaps in any society. 
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While there has been a frequent general assumption that learning is respon- 
sible for the frustration-aggression sequence, or at least greatly modifies it, 
and Whiting (47) has argued for this view from ethnographic evidence, psy- 
chologists seem not to have begun a direct test of this hypothesis until the 
( recent work of Davitz (15). Davitz gave groups of children in a summer camp 
repeated training, either in aggressive games or in constructive activities, in 
a specific playroom. He subsequently subjected them to a frustrating expe- 
rience and immediately introduced them into the same playroom again. Their 
behavior during this postfrustration period was then studied in relation to 
which of the two kinds of training they had undergone. It was found that 
the aggressively trained subjects behaved more aggressively and less con- 
structively than did the constructively trained subjects. The highly specific 
and similar nature of the two situations in this experiment (that of training, 
and that of subsequent observation of behavior) of course limits the breadth 
of generalization that one can feel to be immediately justified, as the author 
recognizes. But for the situation used, there is very clear evidence of the 
important role of previous learning in influencing the overt response to frus- 
tration. 

While Davitz’s experiment has been introduced here in connection with 
the topic of aggression, it also has a very suggestive bearing upon the pre- 
vious topic of psychodynamics and performance. A slightly different exper- 
iment might well demonstrate that previous training in aggression (or in 
some other task-irrelevant responses), in comparison with no such training, 
tends to lower quality of performance in a task because of producing inter- 
fering responses. In the studies cited in the preceding section, where explan- 
atory use was made of the previous learning by subjects of responses to frus- 
tration, no evidence was actually available about that learning; it was 
merely inferred from present characteristics of subjects, and constitutional 
differences could perfectly well have been the major source of those character- 
istics so far as present evidence could show. 


IDENTIFICATION 


Several studies have attempted to verify hypotheses associated with the 
concept of identification. All of these have employed a cross-sectional method 
obtaining from subjects some index of identification and at the same time a 
measure of other characteristics which might be associated with degree of 
identification. Sopchak (38) measured identification by the degree of similar- 
ity between a subject’s own responses on the MMPI? and the responses which 
he believed his father (or, separately, his mother) would be likely to give. 
For male subjects, Sopchak reports a negative relationship between degree of 
identification and all nine maladjustment measures on the MMPI, with some 
slight evidence of a closer relationship for identification with father rather 
than mother. One might wonder whether this finding results merely from a 
uniform tendency for children to think of their parents as giving ‘‘normal” 
responses; some of the other possible criticisms, inevitable in the light of the 


XUM 


PERSONALITY 163 


methods used, are discussed by Sopchak. For a much smaller group of female 
subjects, the results were more complicated; for identification with father, 
the results with males were confirmed, but for identification with mother 
consistent though not significant results suggest a relation in the opposite 
direction. For both sexes, the relation to general identification score appears 
to be greater for the ‘‘psychotic” variables than for the ‘‘neurotic”’ variables. 

Cass (9, 10) employed a more varied questionnaire in attempting to meas- 
ure identification, a questionnaire concerned with a variety of personal pref- 
erences. Her index of identification was the degree of agreement between 
the child’s own responses and the mother’s own responses. She tests several 
relationships between this index and other variables. The one most pertinent, 
for comparison with Sopchak’s results is delinquency; she finds no evidence 
of a relationship between delinquency and identification scores (10). Pos- 
sibly this is because, as Sopchak argues, the child can only identify with the 
image he has of his parent, not with the parent as he actually is; on the other 
hand, Sopchak’s technique has the difficulty that it may only be measuring 
projection. In this field where measurement offers such problems, it may be 
useful in further studies of identification to include both indices. Results 
confirmed through both indices will have greater plausibility than those con- 
firmed through only one of the two fallible indices. 

Meanwhile, there might well be wonder whether any such techniques as 
those used by Cass and Sopchak penetrate at all to the largely unconscious 
processes to which the term identification has been applied by clinicians. 
Evidence supplied by Cava & Raush (12) is very encouraging on this point. 
They obtained from boys in the twelfth grade a measure parallel in nature 
to that of Sopchak, but based on the Strong Vocational Interest Blank. The 
subjects responded to the Blank for themselves, and as they guessed their 
fathers would respond. The measure of identification consisted of the number 
of items on which these two responses were the same. The same subjects 
were also given the Blacky Test [see Blum (5)], in which stories are told in 
response to pictures of dogs in situations relevant to important sources of 
human conflict. 


The following dimensions of the test were chosen for investigation because of 
their theoretical relationship to the identification process: (2) Oedipal Intensity, (6) 
Castration Anxiety, (c) Identification, and (d) Ego Ideal. A fifth index representing 
an over-all estimate of conflict in these four areas was also used (12, p. 855). 


Scores for disturbance in each of these areas were found, as predicted,to 
be negatively related to the measure of identification (or of perceived simi- 
larity, as the authors call it); for one of the areas, and for the over-all dis- 
turbance score, the relationship was statistically significant. 


PERSONALITY STRUCTURE 


Trait concepts associated with psychoanalytic theory of psychosexual develop- 
ment.—Psychoanalytic theory of psychosexual development suggests, though \ 
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without necessarily implying, a number of predictions about intercorrela- 
tions among behavior tendencies. For example, if the theory interprets sev- 
eral personal characteristics as an outcome of anal fixation, it is possible to 
suppose that these characteristics should be correlated with each other be- 
cause of their common origin. Various empirical correlations have received 
special attention in the past because they tend to confirm or to nullify such 
predictions. Surprisingly, however, it is only recently that comprehensive 
studies directed at testing a variety of such predictions have begun to appear. 

In a way, the most comprehensive of these studies is that by Barnes (3), 
which is concerned with a variety of interrelationships predicted from the 
notions of oral, anal, and phallic levels of development. From the psycho- 
analytic literature, Barnes derived a number of trait adjectives associated 
with each of these developmental levels. The status of subjects on each of 
these traits was then measured by their responses to five or ten question- 
naire items, involving either self-rating or expression of attitudes. Such trait 
scores were obtained for a large sample of male college students and inter- 
correlated, and a factor analysis was performed. There did not emerge any 
set of three factors corresponding to the three psychosexual levels. This is a 
valuable fact to have established, however skeptical one might be in advance 
about the suitability of the kind of data employed. The value of Barnes’ 
study does not stop with this general negative finding, however. He reports 
some partial groupings of traits which are in accord with certain of the pre- 
dictions based on psychoanalytic concepts, and these positive findings too 
will need to be taken into account in reaching, in the future, some satisfac- 
tory evaluation of the psychoanalytic trait concepts. 

Another more specific study of similar nature has been published by Blum 
& Miller (7). It is concerned only with correlations predicted from the con- 
/cept of oral character. While the authors say they are using ‘‘conventional 
psychological methods,” their specific choice of methods is highly ingenious 
and well adapted to the problem. Studying third-grade children, they used 
as a basic measure of orality the frequency of nonpurposive mouth move- 
ments. They then tested the relation of this measure toa variety of behavioral 
observations, sociometric measures, and teachers’ ratings which they felt 
might be symptomatic of oral character traits. There was sufficient confirma- 
tion of predictions to encourage further research which Blum (6) indicates is 
in progress. A major difficulty with the Blum & Miller study as so far re- 
ported is that the results seem to lend themselves to explanation in terms of 
a more general variable of nervousness or maladjustment, instead of requir- 
ing interpretation by reference to a specifically oral variable. 

These two studies represent important additions to a growing body of 
data available for empirical evaluation of these trait concepts which have 
been derived from psychoanalytic theory. The earlier studies have been 
most recently reviewed by Blum (6). Inconclusive but suggestive, must be 
the verdict on all this work to date. Each study is subject to one or more 
criticisms which prevent it from leading to strong conviction either way 
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about the value of these trait concepts. While the defects are various, one 
deficiency of all these studies [with the single exception of that by Goldman- 
Eisler (25)] is that they do not combine the structural study with a study of 
antecedent variables which might eventually permit the nailing down of 
appropriate labels for those clusterings which are consistently confirmed in 
the structural analysis. 

Trait concepts associated with psychodiagnostic categories—No attempt 
will be made here to provide a general review of research relating to abnormal 
behavior. An exception will be made, however, in the case of certain work 
by Eysenck which seems to have special methodological significance for 
other problems of personality study. 

The contribution of special significance here is to be found in Eysenck’s, 
(23) use of what he terms ‘‘criterion analysis.’”’ The data for such an analysis 
consist of scores on a number of variables for individuals in two groups 
which differ in some way believed to be relevant to those scores, e.g., scores 
of normal and neurotic subjects on tests believed to be sensitive to neuroti- 
cism. Independently for each group, a factor analysis is performed. Rotation 
for the first factor is then done in such a way as to maximize the correlation 
between the factor loading of the variables and the accuracy with which the 
variables discriminate between the two groups; again, this is done independ- 
ently for each group, except of course that the criterion of discrimination is 
the same for both rotations. The use of such a criterion as a basis for rotation 
seems to justify a high degree of confidence that the underlying variable 
within each group identified by the factor is the same variable on which the 
two groups are distinguished. Inspection of the factor loadings, then, may 
give more accurate information than was available before about the precise 
nature of the variable. In just this way, Eysenck offers a contribution to the 
measurement and to the understanding of two dimensions, a neurotic dimen- 
sion and a psychotic dimension. 

This method seems capable of fruitful application in a number of ways. 
True, criterion groups are not available for many of the dimensions which 
the student of personality wishes to explore. But important criterion groups 
may be found for many dimensions when they are looked for. In particular, 
it would seem that this method might add to the value of studies exploring 
the relation of a variety of supposedly consequent variables to a single ante- 
cedent variable, where roughly identifiable position on the antecedent vari- 
able provides the basis for segregation of subjects into two groups. 

That the technique of criterion analysis is no cure-all for the problems - 
of identifying and interpreting basic dimensions of variation in personality, 
and that just as much care as ever is needed in planning a study which will 
use it, is well illustrated by one point made by Eysenck in a rather different 
connection. In a study which was exploring both the psychotic dimension 
and a possible schizothymic-cyclothymic dimension (for which the criterion 
groups were patients diagnosed as schizophrenic or manic depressive), 
Eysenck (22) reports that he found no evidence for the latter dimension. He 
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shows quite clearly, however, that this is because the psychotic dimension 
had already been isolated in the same study. Criterion analysis applied to 
the schizophrenics and manic depressives alone yielded a factor which was 
nicely correlated with the group difference. It was only because a group of 
normals was simultaneously being studied, that this factor was seen to be 
identical with that distinguishing normals from the psychotic group as a 
whole, and not a distinctive schizothymic-cyclothymic factor. 

Search for basic traits through factor analysis.—Several articles during the 
past year have reported results of factor analyses of personality measures. 
In the opinion of the reviewer, the significance of such results is so completely 
dependent upon systematic comparison of findings of diverse studies that 
there is little point in reviewing separately all the specific contributions in 
any one year. One of the recent studies, however, is of such distinctive inter- 
est as to warrant separate attention here. This is the study by Cattell & 
Cross (11), which reports the results of a factor analysis of intraindividual 
variation on motivational variables, and compares them with the results of 
earlier analyses of interindividual variation on the same or similar variables. 

Cattell & Cross dealt with 20 statements expressing specific motivational 
interests, statements which for the most part had shown high loadings on nine 
factors identified in two interindividual studies. Responses of a subject to 
these statements were obtained by three different techniques (scores ob- 
tained by the three techniques then being pooled), twice a day for 40 days. 
Intercorrelations among the 20 variables, with respect to session-to-session 
variation, were then calculated and factor-analyzed. 

Perhaps of greatest interest is Cattell & Cross’s finding that most of the 
factors emerging from this analysis could reasonably be identified with 
factors previously emerging from interindividual correlations. Obviously 
it would be premature to attempt to draw any conclusions, from this one an- 
alysis, about which motivational factors will be found dependably to show 
this consistency for interindividual and intraindividual variation, and which 
factors will not. But further study of this sort may have important implica- 
tions, or at least pose important problems, for theoretical interpretation. 
Among the various hypotheses that may be used in explaining interindivid- 
ual variation in personality, some should lead to a prediction of a cor- 
responding temporal variation within the individual and some should not. 
At one extreme, the hypothesis that a particular trait is primarily a function 
of some definite physicochemical condition in which an individual varies 
from time to time, but for which each individual’s variation is around a 
mean level characteristic for him, would suggest that manifestations of 
the trait might well vary in much the same way as between individuals and 
within an individual from time to time. At another extreme, the hypothesis 
that certain items of behavior vary together among individuals because they 
covary in subcultures and each individual has learned the pattern of his 
particular subculture, would not necessarily suggest any corresponding co- 
variation within the individual from time to time. The careful comparison 
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of inter- and intraindividual variation might be an important tool in dis- 
tinguishing among various traits with respect to the nature of their origin. 

A second point of interest in Cattell & Cross’s paper is that it provides 
a new and instructive illustration of how controlled techniques and statistical 
manipulations ordinarily used for the verification of general hypotheses can 
be applied to the understanding of specific cases. The reviewer is somewhat 
skeptical about the efficiency of P-technique as an ordinary clinical instru- 
ment, but it can obviously yield results of great interest where there is justi- 
fication for the necessary expenditure of time. 

Mode of ego defense——In discussing selective recall of failures and suc- 
cesses, several psychologists have suggested that individual differences in this 
measure reflect a general tendency to react to ego threat in some specific 
manner. Direct evidence that this is so, however, has been mostly confined 
thus far to Alper’s very suggestive findings (1) which, because they are based 
on so few cases, cannot be very conclusive. Now Eriksen (19, 20) has pro- 
vided some more substantial evidence. His evidence is limited to the specific 
point that there is some generality of mode of ego-defense. In one study (19), 
he shows that selective recall of successes rather than failures is associated 
with manifestations of perceptual defense in another experimental situation. 
In another study (20) he shows that selective recall of successes rather than 
failures is associated with difficulties in the rote learning of materials which 
seem to be emotionally disturbing to the subject. While Eriksen does not in 
these papers attempt to push back to the more general personality differences 
which may lie behind these consistencies, he establishes the existence of a 
consistency much more securely than before. 
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SOCIAL PSYCHOLOGY AND GROUP PROCESSES!* 


By RICHARD S. CRUTCHFIELD 


Department of Psychology and Institute of Personality Assessment and 
Research, University of California, Berkeley, California 


The field of social psychology continues in healthy ferment. Over-all the 
impression is one of vigor and freshness of ideas, balanced by a growing 
sophistication and maturity of theory, problems, and methods. 

Significant trends in social psychology over recent years persist in current} 
work and become even more firmly established. Interdisciplinary relations | 
with sociology, anthropology, and other social sciences, are broadened. The} 
inherent unity of social psychology with the area of personality becomes more } 
pronounced. Field research takes its proper place alongside laboratory re-! 
search, and the latter shakes off some of its earlier narrowness and artifi- 
ciality. Rigorous conceptualization and model-construction abound. Ling 
frontier field of group dynamics shows perhaps the greatest upsurge of all,} 
with its convincing demonstrations of how crucial psychological variables, 
can be dealt with experimentally in genuine group settings. Appropriate to! 
its central psychological significance, the study of social perception makes 
progress, though all too haltingly. Attitude change research has been rescued 
from the doldrums by its present emphasis upon explicit conceptualization, 
careful design, and attachment to larger theory. And subsidiary to all these 
areas, methodological and technical advances more than keep pace. 

In contrast to these trends it is easy to detect a favorable decline in the 
kind of unconnected, socially irrelevant, and atheoretical studies that have 
traditionally abounded in the field of social psychology. 

One trend not manifested as emphatically as might be desired is that 
toward insightful social psychological theorizing which, distinct from the 
more formal and systematic hypothetico-deductive type of which there is 
a considerable amount, is theorizing in its most fundamental sense, namely, 
the creative conception of psychological relationships which must occur 
before the more formal research and theory construction can proceed. 

All in all, the year’s work has been largely in the nature of a consolidation 
of the above trends and of laying of groundwork for things to come. It can 
not properly be called a vintage year, in view of the relative scarcity of 
products having genuine brilliance, flavor, and richness. But that there are 


noteworthy exceptions to this becomes abundantly clear in the review which 
follows. 


1 The survey of literature to which this review pertains was completed in June, 
1953. Mrs. Britomar Handlon and Mr. William T. Richardson have been invaluable 
in collecting material for the review. 

2 The following abbreviation is used in this review: MMPI (Minnesota Multi- 
phasic Personality Inventory). 
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SYSTEMATIC TREATMENT OF THE FIELD oF SOCIAL PSYCHOLOGY 


A penetrating treatise on fundamental problems of social psychology 
has been written by Asch (6). Though cast superficially in the form of a 
textbook, it is directed at a more profound level than is true of the typical 
text. Its intent is to bring into sharper focus and to seek theoretical and em- 
pirical clarification of the underlying psychological assumptions of social 
psychology. It is not offered as a comprehensive ‘‘system’”’ of social psycho- 
logical theory, but rather as a connected series of discussions of the basic 
nature of social phenomena. 

The orientation is that of Gestalt theory, the fruitfulness of which Asch 
wishes to demonstrate for social psychology. But a reader unsympathetic to 
the Gestalt view need not take alarm; the discussion is valuable for its 
sensitive isolation and analysis of critical theoretical issues confronting the 
social psychologist, no matter what his systematic persuasion may be. 

As might be expected of the Gestalt orientation, heavy stress is laid on 
the necessity for phenomenological description of the structural properties 
of experience and action. The keynote is the role of insight and understand- 
ing in the processes of human interaction. The superiority of concepts stress- 
ing sensible and insightful cognitive relations over arbitrary (‘‘blind’’) con- 
nections is argued at every opportunity, especially in reference to social 
perception, social suggestion, and attitude change. These topics are richly 
illustrated from Asch’s own pioneering experimental research on how 
people perceive others, how individual judgment is distorted by group 
pressure, etc. 

Considerable attention is given to analysis of aspects of social phenomena 
that many social psychologists have been reluctant to explore, such as the 
psychological basis of group interaction, the ‘‘group mind’’ concept, social 
value, and cultural relativism. The flavor of Asch’s thinking is well exempli- 
fied by his treatment of ‘‘doctrines of man.” He disagrees sharply with the 
psychological assumptions and implications of several prevailing ‘‘doctrines,”’ 
especially, the ego-centered character of man, the supremacy of irrational 
emotions, the primacy of rationalization in human thinking, the basis of hu- 
man experience in arbitrary association and conditioning, and the roots of 
adult attitudes in childhood experiences. That he exaggerates the case against 
these doctrines, doing less than justice to their positive contributions, is 
undeniable, but in this fashion he makes explicit the tacit assumptions in 
these doctrines and illuminates difficulties inherent in them. 

Along more conventional lines are textbooks by Sprott (118), on a fairly 
nontechnical level; by Britt (19), a revision of an earlier book; and by Bonner 
(17), characterized by a heavy emphasis upon ethnological data and cross- 
cultural comparisons of human behavior. 

Relationships of social psychology to other disciplines—Several writers 
address themselves to the question of the systematic status of social psy- 

\ chology in relation to other fields. Tolman (126) presents an analysis of the 
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relations between sociology and psychology by comparing the nature of} 


independent, intervening, and dependent variables in the two disciplines. 
Sociology, as well as anthropology, is concerned with such independent vari- 
ables as the ‘‘surrounds”’ (climate, geography, history, other groups, etc.), 
and special personality characteristics and psychological processes of mem- 
bers of the given group; with such intervening variables as social structures, 
ideologies, and roles; with such dependent variables as role performances 
and group productions. Psychology, on the other hand, is concerned with 
such independent variables as constitutional factors and physiological drive 
states in the individual, and physical and social stimulus factors affecting 
him; with such intervening variables as values, cognitions, and behavior- 
field forces; with such dependent variables as behavioral achievements or 
attempted achievements. With this schema of relevant variables for the two 
disciplines, Tolman seeks to clarify the types of theoretical and research 
problems that properly fall under such headings as sociological theory, 
psychological theory, social psychology, and psychological sociology. 

Lindesmith & Strauss (81) criticize attempts to extend the concepts and 
theories of the experimental psychology of learning (drawn heavily from 
animal psychology) to the problems of social psychology, which they regard 
as properly treated only at the human level. On the other hand, Smith & 
Ross (115) provide an informative review of the literature on social behavior 
of vertebrates, 1939 to 1950, with an argument for the importance of work 
in comparative social psychology. They stress the needs for extending social 
study of animals to more species and to wider settings, and for the guiding 
of such research by a more systematic conceptual framework. 

Bendix (11) points out shortcomings in the usual psychiatric conceptions 
of collective behavior which, in his opinion, give insufficient weight to the 
mores and folkways of a society as the critical determinants of behavior. 

How interdisciplinary planning for team research on social behavior in 
communities (the Springdale Project) may be made effective is discussed 
by Bronfenbrenner & Devereux (20). 


Group PROCESSES 


Models and methods in group study.—Research and theorizing on group 
processes are increasingly sophisticated in emphasis upon formal concep- 
tualization, mathematical model-construction, and rigorous experimental 
design. This is paralleled by rapid growth of techniques for the study of 
groups. 

One of the more impressive attempts in conceptual formalization is 
Simon’s (113) demonstration of “how the mathematization of a body of 
theory can help in the clarification of concepts, in the examination of the 
independence or non-dependence of postulates, and in the derivation of new 
propositions that suggest additional ways of subjecting a theory to empirical 
testing.”” Taking an example from Homans’ The Human Group (60), he sets 
up equations involving four group variables: intensity of interaction among 








174 CRUTCHFIELD 


members, level of friendliness among members, amount of activity carried 
on by members of the group, and amount of activity imposed on the group 
by the external environment. From these equations he derives the verbaliza- 
tions of Homans, plus certain others. The special importance of Simon’s 
study lies not so much in the particular deductions produced as in the dem- 
onstration of the effectiveness of a mathematical model in arriving at them 
by formal procedures. The weakness lies in the point, acknowledged by the 
author himself, that the equations must necessarily leave out many variables 
and that numerous simplifying assumptions must be made. As is true of 
most models in present-day social science, even the best, this one consists 
more in over-simplification than in idealization, which is the proper ultimate 
aim of a model. 

Stephan & Mishler (119) concern themselves with another aspect of the 
application of mathematical functions to study of groups. They work out a 
simple mathematical function that appears to express quite well the distribu- 
tion of participation within a particular type of small discussion-group. 
Thirty-six groups, varying in size from 4 to 12 members, were observed in 
a total of 81 meetings. An exponential function that accommodates the ob- 
served data on distribution of participation is: 


P; = ar! 


where P is the estimated percentage for subject ranked i, r is the ratio of 
percentages for adjacent ranks, and a is the estimate for subject ranked 1. 
(The ranking refers to the number of units of participation.) 

When three conditions are fulfilled—(a) that there is a range of verbal 
participation potential among the members, (b) that there is no systematic 
interference with free competitive expression of these potentials, and (c) 
that there is an absence of well differentiated roles among the members— 
then the size of the group is found to be an important parameter affecting 
the magnitude of the ratio in the basic equation. 

A three-dimensional model for visual analysis of group structure is 
described by Chapin (23). The three dimensions are (a) position (official), 
(b) distance (differences between individuals in scores of status scales, etc.), 
and (c) direction (toward upper status or lower status as expre:sed by 
incoming versus outgoing sociometric choices). Two main assumptions are 
involved in the model, first, that equality in incoming and outgoing socio- 
metric choices is a rough measure of intragroup reciprocity, and second, that 
the relative size of this intragroup reciprocity is one measure of the cohesion 
or solidarity of the whole group. 

A particularly promising mathematical model for the study of groups, 
as well as for wider problems in psychology and sociology, is the theory of 
graphs discussed by Harary & Norman (51). These two mathematicians 
provide an engagingly clear and concise outline showing the relevance of 


* This distinction between “‘over-simplification” and “idealization” in mathe- 
matical models in the social sciences has been discussed by Kaplan (67). 
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this discipline of mathematics for such problems of conceptualization as 
those to be found in group patterns and networks. Typical problems are 
those having to do with group structure, communication, stability and 
change, efficiency, etc. Comparison of graph theory with Lewin’s topological 
representation is made, and the authors introduce some new concepts into 
the theory of graphs with discussion of their psychological implications. 

In response to recent criticisms of the J-curve as an effective model for 
analysis of conformity behavior, Solomon (116) comments on the need for 
restricting use of the J-curve to the kinds of appropriate conformity situa- 
tions for which it was intended, on the soundness of the scaling methods 
used in developing the method, and on the impropriety of skewness and 
kurtosis as alternative measures of conformity, inasmuch as the J-curve 
assumes not an infinite number, but only four, principal variables: conform- 
ity producing agencies, simple chance as a leveling out factor, physiological 
or biological factors, and personality deviations. Argyle (5) discusses prob- 
lems of experimental design in the study of small social groups. 

Theory of groups——Horowitz & Perlmutter (61) give an extended dis- 
cussion of the concept of the social group. They argue for the propriety and 
necessity of development of laws of group behavior pertaining to group 
learning, group memory, group perception, and the like, as distinct from, 
and not merely reifications of, laws of individual behavior of perception, 
memory, and learning. They point to ambiguities and inconsistencies in the 
position of Krech & Crutchfield (71) on this matter, although acknowledging 
that those authors have argued for the essential independence of several 
“levels” of study of behavioral processes. 

A thoroughgoing treatment of the theory of group learning is provided 
by Cattell (22). Groups, he asserts, have three aspects: (a) population 
characteristics, which are the averages of characteristics of individual mem- 
bers; (b) structural characteristics; and (c) syntality characteristics, where 
syntality is to the group what personality is to the individual. Group 
learning is defined as ‘‘a change in syntality when the group is exposed to a 
certain environmental situation.’’ Of the three general types of individual 
learning, conditioning, insightful, and trial-and-error, only the latter is im- 
portant in group learning. The investigation of group learning may begim 
by exploration of the variables from classical learning, such as frequency, 
motivation, etc., but more interesting are the ‘internal dependent vari- 
ables,” e.g., size of group, length of association of its members, type of 
leadership, etc. Illustrative hypotheses relating to such group variables 
are, that large groups learn more slowly than small groups and that groups 
with a formally appointed leader learn more rapidly than leaderless groups. 
Cattell’s treatment is refreshing and enlightening, though on the side of 
conceptual development it may reflect too much reliance on principles of 
individual learning as it addresses itself to what is asserted to be a distinctly 
separate problem, that of group learning. 

An outline of group relationships designed to clarify and organize prob- 
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lems of research is presented by Landecker (74). He discusses four types of 
group integration: (a) cultural, the degree to which cultural standards are 
mutually consistent within the group; (b) normative, the degree to which 
conduct in the group conforms to its standards; (c) communicative, the de- 
gree to which members of the group are linked to one another by exchanges of 
meanings; and (d) functional, the degree to which they are linked to one 
another by exchanges of services. 

Determinants of group formation and structure-—One of the critical prob- 
lems in understanding group structure and behavior concerns what is con- 
tributed by the specific characteristics of individual members who consti- 
tute the group. Relatively little experimental work has yet been done on 
this problem. A significant study by Haythorn (55) seeks to evaluate the 
influence of individual members on small groups by systematically rotating 
the membership of the groups in.such a way as to isolate the influence of a 
given individual without regard for the makeup of any particular group. 
Sixteen male students of the Navy Reserve Officers’ Training Corps met in 
groups of four, working on a syllogistic reasoning task, a mechanical assem- 
bly task, and a discussion task. Each subject worked with each other subject 
only once, taking part in five different groups. Ratings were made of various 
characteristics of group behavior and of individual behavior in the group, 
such as aggressiveness, cooperativeness, efficiency, etc. Personality question- 
naire data were also obtained. In general, it is found that the behavior of 
individual members significantly affects the characteristics of small groups; 
for example, effective group functioning is facilitated by cooperativeness, 
efficiency, and insight, while behavior termed “‘striving for individual prom- 
inence’’ reduces group cohesiveness and friendliness. As to personality rela- 
tionships, it appears that mature, accepting persons facilitate the group 
activity, while suspicious, nonaccepting persons depress the smooth func- 
tioning of the group. The importance of this study is less in its verification 
of somewhat obvious relationships between group behavior and character- 
istics of members than in its demonstration that these two sets of variables 
can be effectively separated in an experimental setting. 

Gilchrist (45) has studied the formation of social groups under conditions 
of success and failure. After working individually at tasks in which failure 
and success were arbitrarily manipulated by the experimenter, and then 
announced, subjects were required to choose their preferred work partner in 
a further group task involving a pair. One experimental variable was grant- 
ing or not granting the choice. Another was the arbitrarily controlled success 
or failure of the formed group. Choices after the group experience were also 
determined. It appears that when such an unorganized collection of individ- 
uals is required to form groups, the individuals tend to form them around 
those others who have been successful. Failures do not tend to form groups, 
that is, by reciprocating one another's choices on the basis of mutual failure. 
The emergence of such failure groups is to be understood as determined by 
events occurring after the initial grouping choice has been made. The evi- 
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dence also points to the conclusion that knowledge of an individual’s level 
of aspiration about a task and of his expectations of success or failure with 
other individuals may lead to a reliable prediction of his choices of group 
partners, and hence, in a larger framework, to predictions about the emer- 
gence of specific groupings. The study also throws light on the comparative 
strength of expectations of success and failure with others before and after 
actual work experience with them. 

A highly significant study of group behavior by Festinger, Pepitone & 
Newcomb (41) deserves detailed review. Their concern is with consequences 
of what may be called deindividuation in a group, characterized in their own 
words as follows: 


There occurs sometimes in groups a state of affairs in which the individuals act 
as if they were “‘submerged in the group.’”’ Such a state of affairs may be described as 
one of de-individuation; that is, individuals are not seen or paid attention to as indi- 
viduals. The members do not feel that they stand out as individuals. Others are not 
singling a person out for attention nor is the person singling out others. 


The experimental design was directed toward study of two propositions: (a) 
that deindividuation in a group is accompanied by a reduction in inner re- 
straints for the members; (b) that groups characterized by such deindivjdua- 
tion and reduction of inner restraints are more attractive to their memibers 
To accomplish deindividuation in 23 groups of four to seven members each. 
conditions were set so as to encourage a reduction of inner restraints in dis- 
cussing feelings of hostility toward their parents. The degree of such reduc- 
tion in each group was measured by the relative predominance of negative 
over positive statements concerning parents. The degree of deindividuation 
was measured by the extent to which the group members were unable after 
the group discussion to identify which members had made certain statements. 

The first proposition is confirmed by a statistically significant correlation 
for the 23 groups between the above two measures, degree of reduction of 
inner restraints and degree of deindividuation. Four alternative theories to 
account for this obtained relationship are discussed and refuted: namely, 
repression, laws of memory, division of attention, and individual reactions 
other than the group phenomenon. The second proposition, saying that de- 
individuation is related to attractiveness of the group to its members, the 
latter being determined by the subject’s expression of desire to carry on fur- 
ther discussions with his group, is also confirmed. 

This study is noteworthy for its effective demonstration of how a long 
alleged but unverified group phenomenon of central theoretical impor- 
tance can be brought under precise measurement and conceptual exploration. 

Intragroup tensions—An effective combination of conceptualization, 
methodology, and social relevance is found in Herbst’s study of the family 
(58). Using a Lewinian approach, he distinguishes three concepts, the ‘‘family 
field,’ the ‘‘family structure,” and the ‘“‘tension balance.’’ The total family 
field can be constructed from an analysis of the ‘‘path field’’ of each member, 
which in turn derives from a study of his participation pattern in various 
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family activities. Herbst finds that within the population of Australian fami- 
lies he studies, the family field is virtually a cultural constant. Evidence for 
this is seen in the finding that given the number of regions in which a family 
member participates, it is possible to predict accurately which ones they are. 
On the other hand, family structure shows wide variations among families 
in his sample, characterized by four quite different patterns of husband-wife 
power-interaction: (a) husband dominant, (6) wife dominant, (c) autonomic, 
with separation of spheres of influence, (d) syncratic, with joint, co-operative 
determination. Each type of pattern has associated with it a different level 
of tension in the family. Herbst’s analysis permits him to draw a number of 
inferences concerning tension levels, balance, and change. For instance, al- 
though in specific areas of syncratic co-operative relationships tension may 
be high, the total family tension is low, thus typifying a generally efficient 
and harmonious family structure. Conversely, autocratic and autonomous 
relationships may achieve reduction of tension in connection with specific 
family areas at the serious cost of increased general family tension. 

In connection with other aspects of the problem of tensions in groups, 
Thibaut & Coules (125) make an experimental evaluation of the role of com- 
munication in the reduction of interpersonal hostility. Their study is directed 
at both the “‘catharsis’’ hypothesis [Dollard et al. (31)] and the “autistic 
hostility”’ hypothesis [Newcomb (95)]. The experiment involved groups con- 
sisting of two persons, one of whom was a confederate of the experimenter, 
the other being the real experimental subject. The two persons looking at 
one another through a window, wrote and exchanged messages revealing 
information about themselves in order that they could later write a person- 
ality sketch of the other. The confederate’s messages were standardized and 
designed to show him as opinionated and arrogant. The final one from him 
consisted of an outright attack on the other subject. Half of the experi- 
mental subjects were permitted to reply to this note and half were not. All 
then wrote new personality sketches of the other. Those allowed to reply to 
the final note showed fewer unfriendly elements in the personality sketch 
than did those not permitted to respond, thus supporting the hypothesis 
that ‘‘the communication of hostility to an instigator will tend to reduce the 
residual hostility toward that instigator.’’ The effects are found to interact 
with those pertaining to the ‘‘autistic hostility” hypothesis, in that the higher 
the level of postexperimental hostility revealed in the sketch the stronger the 
desire to discontinue the message sending in the no-communication group, 
and that initially hostile subjects, when compared with initially friendly 
subjects, initiate a smaller volume of communication during the preinstiga- 
tion period. In this study one is struck once again by the ingenuity with 
which previously unverified hypotheses about complex aspects of group 
functioning are being put to precise test in contemporary experimental re- 
search in group dynamics. 

Further light on the nature of interpersonal tensions in groups is given 
by Zajonc’s study (137) of aggressive attitudes of the ‘‘stranger’’ as a func- 
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tion of conformity pressures. Twenty Indian students of shorter residence in 
the United States were compared with 20 of longer residence for their attitu- 
dinal aggression against, and difficulty in conformity with, a number of 
American behavior patterns, such as scholastic competition, treatment of 
women, dating, kissing in public, etc. Results confirm hypotheses based on 
the sociological concept of the ‘‘stranger” and on the Dollard (31) frustra- 
tion-aggression sequence. Zajonc holds that the “‘stranger’’ needs to conform, 
is frustrated in this need by his super-ego bound in another culture, is there- 
upon instigated to aggressive response, is free to express his aggression be- 
cause of the permissiveness of his role as ‘‘stranger,”” and hence finds the 
instigation to aggression reduced. The frustration-aggression hypothesis, 
long prominent in social psychological theorizing, is extended in a needed 
direction by Pastore (99), who supplies evidence that the perceived “‘arbi- 
trariness’”’ of the frustration is significantly related to the amount of ag- 
gression elicited. 

Group productivity—On the continuing problem of the productivity of 
small groups, McCurdy & Lambert (83) describe a situation in which, on a 
problem requiring genuine co-operation, small groups do not work more 
successfully than do individuals. The authors explain the apparent discrep- 
ancy of these findings with earlier findings, such as those by Watson (129) 
and Shaw (111), where group performance exceeded that of individuals, by 
pointing out that in the present experiment the problem (levers to be pushed 
to turn on lights in a kind of “group maze’’) is such that the less alert and, 
less adequate individual member will necessarily interfere to some extent 
with the group progress, inasmuch as his contribution is as required as od 
of better members. In previous studies, on the other hand, the group tasks 
were such as to involve a cumulative product, or such as to reflect mainly the 
performance of certain group members who were superior in insight, reason- 
ing, etc., upon whom the burden of group responsibility tended to fall. A 
subsidiary finding of McCurdy & Lambert is that in their group situation 
neither a ‘‘democratic’’ nor an ‘‘authoritarian’’ atmosphere proves the more 
favorable in terms of group productivity. 

The relative superiority of groups over individuals in the learning of 
nonsense syllables is by no means clear-cut according to the work of Perlmut- 
ter & Montmollin (100), who find that although groups of three who learn 
and recall syllables by a ‘‘method of agreement”’ are superior to the average 
individual working alone, in many cases individuals surpass the group score. 
There appears to be evidence that individuals who have first participated in 
the group learning task do better when subsequently shifted to working 
alone than do individuals who have not had the benefit of the prior group 
experience. 

A methodologically and conceptually sophisticated study by Darley, 
Gross & Martin (27) throws light on some of the factors associated with 
group productivity. They arranged a contest among 13 university living 
groups, the goal being to prepare a “‘plan for better co-operative living’’ in 
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the co-operative housing project. The criterion of group productivity for 
each house was the over-all quality rating given by a committee of judges 
to the house’s report after a month long period of preparation. The predictor 
variables in the study were derived from five hypotheses regarding factors as- 
sociated with group productivity. All other things being equal, the produc- 
tivity of a group will depend upon the extent to which (a) the group goal is 
accepted, (b) the previous group experience has been successful and satisfy- 
ing, (c) appropriate use is made of differential skills within the group mem- 
bership, (d) leadership is both accepted and persistent, and (e) there are 
available within the group individuals of requisite ability or special skill. 
Rather than being content with a single datum relating to each of these 
hypotheses (a shortcoming common to many studies), these experimenters 
worked out 18 predictor variables pertaining to the five hypotheses, the 
data being gathered from a variety of sources, including participant-observa- 
tion, attendance records, autobiographical reports, sociometric measures, 
etc. Rank-order correlations (N=13) of the predictor variables with the cri- 
terion of group productivity are such as to lend support to all five of these 
hypotheses; for example, efficiency of house organization for the contest 
correlates .86, support of formal leader correlates .51, and enthusiasm for 
the task correlates .59 with the criterion. It is equally significant for more 
general implications of the conceptualization problem in group research 
that the several predictor variables which clustered under the heading of a 
given hypothesis did not necessarily show very high communality and did 
not all yield significant correlations with the criterion. As the authors remark, 
this ‘‘emphasizes the importance of careful derivations of behavioral coordi- 
nates in testing the relationships involved in group processes.” 

Related to the problem of group productivity is a validation of a socio- 
metric regrouping procedure reported by Van Zelst (127). Carpenters, formed 
into work teams according to their sociometric choices, show significant 
differences from a control group in job satisfaction, labor turnover, and labor 
and material costs. 

Israeli (64) describes a provocative study of group interaction in creative 
and critical activity; it is a topic that has attracted virtually no systematic 
attention in past research and that has tended to concentrate on group pro- 
ductivity having to do with logical problem-solving, quantity of output, 
and the like, rather than with the higher forms of creativity. Experimental 
use is made of a technique termed ‘‘method of serial production,” in which a 
creative work, such as a poem, an essay, or a drawing, is spontaneously and 
quickly constructed and criticized by one person after another in a chain-like 
order in a small group. The interest centers on the character of the social 
interaction between an artist and other artists, between a critic and other 
critics, and between artists and critics, and on the influence of such inter- 
actions on the nature of the final creative product. Experimental details are 
scantily reported and the results presented must be considered as merely in- 
dicative of the research potentialities of this area. Representative of the 
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reported findings of the study are: (a) that the initial step in a construction 
dominates later development, ‘‘a weak, ambiguous, or unstructured begin- 
ning hampers writers who follow” whereas ‘“‘a strong and coherent opening 
challenges and directs their work’’; and (0) that ‘‘creative performance seems 
to be blocked towards the end of a sequence” resembling “satiation in pro- 
ductive thinking as considered by Duncker.”’ 

The influence process in groups.—In a study involving a complex manipu- 
lation of conditions and variables, Festinger et al. (40) explore the influence 
process in groups containing extreme deviates in opinion. Subjects (of whom 
there were 443, in groups of six to nine) were required to take a position on 
the probable outcome of a labor dispute. All communications among group 
members were written and were actually constructed arbitrarily by the ex- 
perimenters to simulate the opinions of the other group members. The inde- 
pendent variables manipulated were: (a) high versus low cohesiveness, (b) 
degree of deviation (manipulated by giving the subject false information 
about the positions of other subjects'in the group), (c) presence or absence of 
alleged ‘‘experts,’’ and (d) the alleged availability or unavailability of the ac- 
tual outcome of the labor dispute. The dependent variables measured were: 
(a) readiness to change one’s opinion, (b) attempts to influence others in the 
group, and (c) redefinition of the boundaries of the group, as measured by 
lowered communication to deviate persons. 

In general, deviates are found to be more ready to change their opinions 
and show less confidence in their opinions. Those who change communicate 
less than others, that is, make less attempt to influence others; those deviates 
who do not change, tend to redefine the group boundaries by communicating 
only with those close to their own position and not with the majority farthest 
removed from their own position. The effect of high cohesion introduced into 
the groups is generally to accentuate the influence on the deviates. They at- 
tempt more to influence others in the cohesive group; the nonchanging de- 
viates redefine the group boundaries to a greater degree. Where the correct 
answer is supposed to ‘‘exist,’’ the amount of change is increased, and the at- 
tempts to influence others decreased. The major finding of the study in the 
words of the authors is, ‘‘The relative strength of the tendency to redefine the 
boundaries of the group in response to pressures toward uniformity in- 
creases for a member as the discrepancy between his opinion and the modal 
opinion in the group increases.” 

Striking effects of group pressure on the modification of individual judg- 
ments are reported by Asch (6), who employs a novel technique whereby all 
but one member of a group of subjects are confederates of the experimenter, 
instructed to report falsely on a simple visual judgment of comparative 
length of lines, the one remaining, naive subject being the only person actu- 
ally tested in the situation. By this means, experimental manipulation can 
readily be made of such variables as degree of discrepancy of the false group 
report from the correct judgment, size of the group, size of the majority 
which agrees on the false judgment, etc. Asch finds that even under these 
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conditions of highly structured stimulus material, many subjects are mark- 
edly influenced to go along with the group judgment. Apparently only a 
minor part of the effect is related to genuine perceptual distortion, most 
being ascribable to judgmental distortion or distortion of report. The size of 
the group can be reduced sharply to about three members without appreci- 
able loss in effectiveness of pressure, but the introduction of a single other 
person who gives the correct answer, thus breaking the unanimity of the op- 
position, largely eliminates the subject’s tendency to distort. 

Asch finds marked individual differences in responsiveness to the group 
pressure. Light on the nature of the personality correlates of such “‘independ- 
ence”’ and “‘yielding’’ is given by a study of Barron (7), using some of Asch’s 
subjects. ‘“‘Independents”’ and ‘‘Yielders’”’ are found to be equally stable in 
personality (as inferred from MMPI? responses), but to differ in values and 
self-descriptions. ‘‘Independents” more than “‘Yielders’’ tend to value the in- 
dividual as opposed to the group, to prefer complexity to simplicity, and to 
see themselves as original, emotional, and artistic, rather than obliging, op- 
timistic, efficient, kind, and determined. 

Kelley & Volkart (69) explore the resistance to change of group-anchored 
attitudes by measuring Boy Scouts’ attitudes about Boy Scout activities 
before and after hearing an adult read a paper criticizing these activities. 
Half were told that their answers were strictly private; half, the opposite. 
Three hypotheses were tested: (a) the amount of change in expressed atti- 
tudes produced by a standard communication contrary to the norm will be 
less under public than under private conditions (not confirmed); (6) when 
the influence to change is of standard strength, there will be an inverse rela- 
tionship between change and value of membership to the individual (con- 
firmed); (c) the foregoing relationship will be greater under private than 
under public conditions (confirmed). 

A quite different approach to the study of group influence on individual 
behavior is the experiment of Schachter & Hall (108) in which a speaker at- 
tempts to recruit subjects for a psychological experiment under four condi- 
tions of group-derived restraints. In order, from lowest to highest degree of 
restraint, these were: (a) half the group being secretly instructed to raise 
their hands; (b) the whole group being required to fill out a form indicating 
their interest in volunteering; (c) forms made available to anyone who wanted 
to take one and to be filled out then and there; (d) request of show of hands 
of those wishing to volunteer, forms being passed out to those with hands up. 
Follow-up contacts were made by a combination of letter and phone call. 
The results are that the proportion of persons volunteering is higher for the 
low-restraint conditions, but that these persons are less likely actually to 
show up. Thus, the two extremes of restraint condition are less economical 
than the moderate restraint condition. With low restraint, more volunteer 
but fewer show up; with high restraint, too few volunteer. 

A study of group influence more in the tradition of the older work on 
social facilitation and inhibition is that of Wapner & Alper (128), who find 
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that the presence of an audience increases the time for subjects to make 
choices among alternative words to follow phrases, an unseen audience hav- 


ing a greater delaying effect than a visible audience. The effect is accentuated 


for items having personal reference for the subject. 

Leadership and group influence-—Maier & Solem (84) treat of another 
aspect of the influence process in showing how under proper leadership con- 
ditions, minority opinions may be more effectively utilized in shaping the 
group’s thinking. Thirty-four groups of five or six persons each were supplied 
with a discussion leader while working on a logical reasoning problem. As 
controls, 33 groups were supplied with an observer. The leaders’ function 
was to facilitate the discussion; the observers simply observed. Neither 
leaders nor observers expressed their own views. Group members were 
asked to render individual answers to the problem before and after an 8-min. 
discussion. Beforehand, 45 per cent of the members of both experimental and 
control groups gave correct answers. After the discussion, correct answers 
in the groups with leaders rose to 84 per cent, as compared with 72 per cent 
in the leaderless groups. Further analysis revealed that the major part of the 
obtained difference was attributable to the relatively greater influence that 
individuals with minority opinions exerted in the leader groups. These re- 
sults are interpreted to mean that a discussion leader can function to up- 
grade a group’s thinking by permitting an individual with a minority opin- 
ion time for discussion. In a leaderless discussion, the majority is more likely 
to dominate and the potential contribution of the minority to be lost. 

In reviewing this study it is important to note that the problem was one 
calling for a logical solution. It would be of great interest to compare these 
findings with those on a problem where the solution was less clear-cut, or 
where the subject-matter of the group task consisted of attitudes and opin- 
ions rather than of matters of fact. 

The comparative effect of supervisory and participatory leadership on 
opinion change in small discussion groups is studied by Hare (53) in a repeti- 
tion of the Preston & Heintz experiment (102). The results confirm the earlier 
findings that participatory leadership is more productive of opinion change 
than is supervisory leadership, in which the leader does not join the discus- 
sion. It is interesting that the Preston & Heintz results were obtained with 
college students and the Hare results with groups of boys averaging 13 years 
of age, the leaders averaging 14 years. An incidental finding of some interest 
is that the supervisory leaders themselves showed the least change in opinion. 

Berkowitz (13) contributes an analysis of the phenomenon of leadership- 
sharing in small, decision-making groups, based upon a study of 72 confer- 
ence groups in business, industry, and government. Westerlund (131) reports 
an experimental study of different leadership methods in a Stockholm tele- 
phone exchange, a study fertile of hypotheses but inconclusive as to results. 

Leadership measurement and training—Maier & Zerfoss (85) describe 
‘multiple role playing’ (MRP) techniques as a training and research method 
in leadership. Maier (86) demonstrates that supervisors in industry, after 
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eight hours training in group-decision methods, are markedly more efficient 
in bringing about desired decisions by groups in an MRP test situation than 
are untrained supervisors. 

The leaderless group discussion situation has been shown to have some 
value in the assessment of leadership potential. On the basis of a statistical 
analysis of data from 67 such leaderless discussion situations, Bass, Klubeck 
& Wurster (9) examine the validity of the technique and arrive at certain 
implications concerning ways of increasing the validity. 

Considerable interest in dealing with the leaderless group discussion tech- 
nique has attached to the question of whether or not seating arrangement 
affects the emergence of leaders. Bass & Klubeck (8) report experimental 
results in the negative. 

With relevance to an interesting aspect of leadership attitude and be- 
havior, Eager & Smith (34) administered the Sanford Authoritarian-Equal- 
itarian Scale to 11 college women counselors in a health camp for under- 
privileged children. Later the children took a ‘‘Guess Who’”’ questionnaire, 
which provided data on their perceptions of the leadership style of the coun- 
selors. There is clear evidence that their impressions of the counselors scoring 
high on the Sanford A-E scale differed coherently from their impressions of 
those scoring low, thus providing one minor piece of validation of the A-E 
scale as a measure of leader behavior. 

Fleishman (43) offers a questionnaire developed for the objective de- 
scription of leadership behavior of supervisors in industry, providing meas- 
urement of two relatively independent leadership dimensions: ‘‘considera- 
tion for others’’ and “‘initiating structure.” 

Personal factors in leadershib.—Few new studies are available concerning 
the relation of personal factors to leadership. Martin, Gross & Darley (88) 
conducted an empirical study of 13 groups of girls living in a university co- 
operative village, among whom formal, informal, and mixed-type leaders 
were distinguished. Of approximately 100 personal variables available, only 
two differentiate these leader types: (a) higher level paternal occupation 
mark the formal leaders, and (b) better self-estimates of emotional develop- 
ment mark the mixed type leaders. All three leader types identify to a signifi- 
cantly greater degree with middle class standards than do the followers or 
group isolates, and all indicate greater experience in precollege family activi- 
ties. Gordon (47) presents results generally consistent with the implications 
of Stogdill’s earlier review (120), showing that women dormitory students 
nominated as corridor ‘leaders tend to be rated high on responsibility, as- 
cendency, and absence of hypersensitivity, while there is no similar relation 
with ratings of sociability and refinement. 


OPINION AND ATTITUDE CHANGE 


| The year’s work in the social psychology of attitudes is distinguished by 
| a swelling stream of excellent experimental studies of attitude change, es- 
pecially those concerned with the specification of conditions governing 


change. 
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As part of a program of research on the dynamics of attitude change, 
Jarrett & Sherriffs (66) compare the degree of influence on an attitude by 
arguments presented in three contexts: (a) an exposition frankly favorable 
to a particular attitudinal position, (b) a debate between apologists of oppos- 
ing positions, and (c) an impartial presentation of the two incompatible 
arguments. It is hypothesized that in the first context, expressed attitudes 
should change in the direction of the argument. In the second context, ex- 
pressed attitudes should move in the direction of the pre-existing bias, the 
amount of change depending upon the degree of emotional involvement in 
the issue. (The latter part of this hypothesis was not properly subject to test 
as a result of the lack of a satisfactory measure of emotional involvement.) 
In the third context, expressed attitudes should become more moderate in 
position. The results are on the whole consistent with these three hypotheses. 

Hovland & Mandell (62) report an experimental comparison of conclu- 
sion-drawing by the communicator and by the audience. A talk on currency 
devaluation was presented on tape. Two variables were manipulated: (a) 
conclusions drawn by the speaker, versus conclusions left unstated; (b) the 
speaker presented as a member of a class which would have a stake in the 
conclusions drawn, versus a speaker presented as impartial. Tests of opinion 
before and after the speech were given. The salient findings are that having 
conclusions drawn by the speaker is most effective in changing the audience’s 
attitude, that there tends to be greater attitude change for the impartial 
speaker situation, and that there is ‘‘no support for the hypothesis that 
drawing one’s own conclusions is particularly effective when one suspects 
the motives of the communicator.”’ Interesting incidental results are that 
the biased communicator is rated by the audience as having given a more 
one-sided presentation, though in fact his talk was identical with that of the 
impartial speaker, and that there is no difference in the factual information 
retained by the listeners for the ‘‘suspicion-arousing”’ and the impartial talks. 

The effects of different intensities of fear arousal in communications were 
explored by Janis & Feshbach (65) in a study of propaganda on dental 
hygiene among high school students. Illustrated talks inducing strong, mild, 
or minimal fear were given, with attitude testing one week before, imme- 
diately after, and one week after. The minimal fear appeal proved most ef- 
fective in eliciting resistance to subsequent counter-propaganda, and in elic- 
iting a higher degree of verbal adherence to a set of recommended practices. 
These findings are interpreted as supportive of the following hypothesis: 
‘‘When fear is strongly aroused but is not fully relieved by the reassurances 
contained in a mass communication, the audience will become motivated to 
ignore or to minimize the importance of the threat.’ Or, more generally: 
“Other things being equal, the more persistent the fear reaction, the greater 
will be the (acquired) motivation to avoid subsequent exposures to internal 
and external cues which were present at the time the fear reaction was 
aroused.” 

An interesting ‘‘sleeper’’ effect in opinion change is demonstrated by 
Weiss (130). In a standard learning situation, 228 subjects learned to asso- 
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ciate labels of ‘“‘true’’ or “‘false’’ with each of eight statements concerning 
effects of smoking. Some subjects were then exposed to a brief counter- 
communication discounting the truth value of the learned associations; others 
had no such counter-communication. Pre- and posttests on an opinion ques- 
tionnaire about the effects of smoking reveal a smaller loss in effectiveness of 
the original communication after six weeks for the group having had the 
immediate counter-communication, thus demonstrating a ‘‘sleeper’’ effect. 

Questions concerning the effect of repeating the same oral propaganda 
on the attitude of listeners are raised experimentally by Cromwell & Kunkel 
(26). Although their results do not clearly indicate a cumulative effect of 
such repetition after 30 days, the inadequacy of the experimental design 
renders the findings inconclusive. 

One more demonstration of the by now thoroughly verified fact that one 
critical source of constraint on the effectiveness of prestige suggestion is the 
degree of structuredness of the judged material is given by Moos & Koslin 
(92) who measure the influence effect of rankings of importance of state- 
ments taken from the Congressional Record allegedly made by Republican or 
Democratic leaders. Suggestion was exhibited on the vaguer statements, i.e., 
those appealing to broad norms, but not on the less vague. An excellent 
theoretical and empirical discussion of this whole matter of degree of struc- 
ture and ease of suggestion effect is found in Asch’s book (6), previously 
mentioned. 

Parrish & Campbell (98) find that radio transcriptions in monologue 
form do not differ in their opinion-change effectiveness from those in dialogue 
form, though the former are more frequently judged to be “‘propagandistic.”’ 
A methodological finding is that an indirect measure of propaganda effect, 
in the form of an information test, is a more sensitive indicator than the 
more usual direct attitude test. 

How the expression of an opinion varies with the person’s ‘‘definition 
of the situation”’ under which it is elicited is shown by Gorden (46). Private 
responses to an ostensibly anonymous questionnaire on opinions about 
Russia were compared with the same person’s public responses during an 
interview in the presence of other persons, all of whom were residents of a co- 
operative living project. Each person also gave his estimates of the group’s 
opinion on the issues. It is found that a person’s publicly expressed opinion 
tends to compromise the difference between his own private opinion and his 
estimate of group opinion. Case studies of extreme conformists and noncon- 
formists in this respect suggest that four factors may distinguish them: (a) 
degree of identification with the group, (b) conception of group attitude 
toward nonconformity, (c) own role in group, and (d) negativism. 

A fairly rare opportunity for measurement of effect of a dramatic news 
event on shifts of opinion is described by Taves (124). Before and after a 
multiple killing in a small town, intensive interviews of a cross section of the 
community were taken on a number of community issues. The killing ap- 
parently produced several shifts in opinion, generalized considerably beyond 
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the immediate character of the crime. More people questioned the ‘“‘good- 
ness’’ of the police department, the integrity of the elected city officials, the 
adequacy of welfare services, and the power of “‘influence’’ groups. 


OPINION-ATTITUDE METHODOLOGY 


Attitude scales and scaling—The literature on the Guttman and related 
methods of scale-analysis continues to grow. Stouffer et al. (121) contribute 
what they call the H-technique for building scales with increased precision. 
In their words, 


The method simply consists of determining a given cutting point in a Guttman or 
Lazarsfeld latent distance scale not by means of a single response, but rather by 
means of several responses, which are formed into a new “‘contrived item.”’ The object 
is to maximize the information available from the basic data, and hence to strengthen 
confidence in the scalability of the area under consideration and the generality of the 
dimension which the scale is defining, and to improve the ranking of individuals 
through reduction of scale error. 


On the problem of classifying nonscale response patterns in a Guttman 
scale, Henry (57) describes a simple method of placement, and Borgatta & 
Hays (18) point out some limitations on the arbitrary classification by the 
method of “‘perfect types,’’ stating what they believe to be a better solution. 

Schuessler (109) explores some problems of item selection in scale- 
analysis, concluding on the basis of an empirical study that an objective 
method of listing the content of the attitude area is needed. Two suggested 
procedures for this are: (a) to construct the universe operationally in terms 
of a content analysis of some body of material that lends itself to this 
method; and (b) to obtain from a random sample of people a complete verbal 
expression of the attitude being studied. The author also points out that 
categories contrived in the method of successive approximations are not 
necessarily homogeneous. A simplified and improved version of the Guttman 
scalogram board is recommended by Marder (87). 

Hovland & Sherif (63, 112) provide new light on certain judgmental 
phenomena in connection with the derivation of Thurstone-type attitude 
scales. They present evidence that those persons whose judgments are 
skewed tend to hold extreme views and that arbitrarily throwing out their 
judgments restricts the range of the scale. Also, there is evidence that the 
more emotionally involved judges (e.g., Negroes on a scale of attitude toward 
Negroes) use a more constricted scale, and that neutral items are displaced 
away from the subject's own position. 

Riley & Toby (105) make an interesting proposal concerning the scaling 
of social objects (‘‘object scales’’), as differentiated from the usual scaling of 
persons (‘‘subject scales’), which would provide a systematic empirical ap- 
proach to the concept of group definition of objects, and their ordering along 
various dimensions. They say, ‘‘Subject scales measure the extent to which 
individuals share a common component. Object scales on the other hand may 
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result from group definitions which are contributed to in different ways by 
different group segments.” 

Survey methodology—Methodological developments in survey research 
continue along numerous fronts, such as methods of data collection, prob- 
lems of coding, sampling, interviewer bias, validity of interviews, etc. 

Metzner & Mann (90) address themselves to a study of the persistent 
problem of comparison of the fixed-alternative questionnaire and the open- 
ended interview. They discover marked discrepancies in personal background 
data as obtained by the two instruments. The interview yields a significantly 
higher percentage of electrical utility employees expressing satisfaction with 
their work than does the anonymous questionnaire. White collar employees 
show greater consistency on the two forms than do other employees. 

The comparative merits of field coding and office coding are examined by 
Woodworth & DeLott (136) with somewhat inconclusive indications that 
the two methods do yield differences in frequency of use of precoded cate- 
gories and in frequency of ‘‘don’t know” responses, fewer of these being ob- 
tained from field coders. 

On questions of sampling methodology, Alpert (4) points out qualifica- 
tions in the use of quota, cluster, and probability sampling deriving from 
certain sociological factors bearing on grouping of people. Maslow & Sakoda 
(89) examine volunteer-error in the Kinsey study with the finding that 
volunteers are higher than nonvolunteers on a measure of “‘self-esteem”’ and 
that ‘‘self-esteem”’ correlates significantly with unconventional sex behavior. 
The specific implication is that sex studies using volunteers as a sample re- 
quire correction for volunteer-error, and this has more general implications, 
of course, for all kinds of studies using the volunteer sample method. 

Procedures for detection and control of interviewer bias are suggested by 
Ferber & Wales (39). The sending of letters forewarning respondents of the 
specific topics to be covered in a coming opinion poll does not produce ap- 
preciable differences in answers obtained in the poll [Nuckols (96)]. Ervin & 
Bower (36) give some practical advice on the increasingly important techni- 
cal problem of language translation in international surveys. 

In the sadly underdeveloped area of validation of survey methods, Lar- 
sen (76) contributes an interesting experimental study of the comparative 
validity of telephone and face-to-face interviews on the measurement of 
message diffusion from leaflets. The Washington Public Opinion Laboratory 
had leaflets dropped which requested, among other things, that the recipients 
mail them in. The relevant areas were then interviewed by telephone and by 
face-to-face interviews. The telephone sample was found to show more exag- 
geration in their report of what they had done, pointing to the necessity for 
especial caution in interpretation of data obtained from telephone interviews. 

Another aspect of validity of interviews is treated by Miller (91), who 
describes the construction and validation of several question batteries de- 
signed to predict voter turnout in a Congressional election in Waukegan, 
Illinois. Before and after election interviews were conducted with a carefully 
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drawn probability sample, whose reports on voting behavior were objec- 
tively checked against official voting records. Seventy-two items were ana- 
lyzed as to their predictive power and several batteries constructed on an 
empirical basis. The best battery still resulted in inclusion of at least 15 per 
cent nonvoters, and even this represents some overstatement of validity of 
the battery, inasmuch as no cross-validation is available. 

Reviews are available of polls and survey research programs in France 
[Dorsey (32)], Western Germany and West Berlin [Kellerer (68)], Holland 
[De] ong (28)], and in agencies of the United States government [Alpert (3)]. 


COMMUNICATION AND RUMOR 


Communication theory and research.~—Deutsch (29) discusses communica- 
tion models in the social sciences, pointing out that the functions of such 
models are ‘‘the organizing, the heuristic, the predictive, and the measur- 
ing,’’ and that the adequacy of a model can be evaluated in respect to criteria 
of originality, simplicity, and realism. On a more concrete level, Eisenstadt 
(35) draws a number of theoretical implications about the communication 
process from a study of 250 new immigrant families being absorbed in Israel. 
One of the main functions of communication within the social system is to 
maintain effective performance of social roles inherent in it, thus insuring 
full social participation and identification with its values. The individual is 
most receptive to those communications which help him to satisfy various 
needs and specifically to perform different status-conferring roles. In general, 
“the communications concerning the technical aspects of new social roles are 
more effectively transmitted than are those relating to institutional values 
and symbols.” 

In another field study of communication as observed within a real life 
setting, in a rural Greek village, Stycos (122) finds that in accordance with 
the ‘“‘opinion leader” hypothesis, the importance of the opinion leader is en- 
hanced in an area of low literacy and meager access to mass media, and his 
role as opinion leader somewhat formalized. Moreover, “information con- 
trollers’’ who have access to radios may emerge in positions to control the 
flow of news. 

In laboratory experimentation on the communication process, Grace (49) 
reports on the effects of different degrees of knowledge about an audience on 
the content of communication. The task was for the subject to study a set of 
objects on a table for two minutes, then report on them to an audience con- 
sisting of another person. This was done under three conditions: (a) without 
knowledge of the sex of the audience, (b) with knowledge of the sex of the 
audience, and (c) with careful briefing about the sex of the audience. Both 
male and female undergraduates served as subjects, and in each of the three 
conditions half reported to a male audience and half to a female audience. It 


4 Communication theory, in its more specialized sense of information theory, psy- 
cholinguistics, etc., is treated by Dr. Miller elsewhere in this volume. 
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appears that the selectivity of content of a person’s communications under 
these conditions is influenced both by his own values and by a clear under- 
standing by him of the nature of the audience to which he will communicate. 

A special issue of the Public Opinion Quarterly devoted to problems of 
international communications research is edited by Lowenthal (82), with 
contributions by Lazarsfeld (79), Lasswell (77), and others. Of particular 
interest are White’s (132) discussion of the problem of resistance to interna- 
tional propaganda among both friends and foes, and a symposium on content 
analysis for the Voice of America. 

Spiegelman, Terwilliger & Fearing (117) study the content of comic 
strips as an important mass medium of communication. Their most interest- 
ing finding is that there is an inverse relationship between reality of depicted 
situation and of characterization; if the situation is ‘‘real,’’ the actors are 
caricatured; if it is ‘“‘irreal,’’ the actors will be treated more realistically. 

A semipopular account of propaganda analysis, public opinion, and mass 
communication, which utilizes surprisingly little of the more recent psycho- 
logical insights and findings on these topics, has been written by Lee (80). 

Berelson’s Content Analysis in Communication Research (12), an extensive 
revision of an earlier account, is an exceptionally useful discussion of the 
definition, rationale, uses, and methods of content analysis. 

Rumor.—Dodd (30) reports progress on a large-scale experimental pro- 
gram to study rumor, with emphasis upon social factors regulating rate and 
other characteristics of message diffusion. An attempt is made to give a for- 
mal analysis of message diffusion, though question remains whether much 
is to be gained from such a formalization. A suggested refinement of the 
Allport & Postman “‘law of rumor” (2) is offered by Chorus (24) who points 
out the relevance of a factor of insightfulness by the person in modifying the 
tendency to transmit the rumor. 

A case report of rumor transmission in a catastrophic situation (the 
Darjeeling landslides of June, 1950, in India) is presented by Sinha (114). 
The data provide support for the view that transformations tend to be in the 
direction of exaggeration. Rather pronounced distortions in immediate per- 
ception by participants are also reported. 


Socio-PoLiTICAL ATTITUDES AND PARTICIPATION 


The most interesting of the year’s studies of socio-political attitudes have 
to do with their personal determinants, especially in relation to the variable 
of liberalism-conservatism. That liberalism-conservatism can properly be 
considered a unitary dimension is challenged by the study of Kerr (70) who 
intercorrelated measures of political, economic, social, religious, and aesthetic 
liberalism on 246 university men; he found these five dimensions to be rela- 
tively independent of one another. Using the separate liberalism measures, 
data are presented to show that Republican Party membership is associated 
with political liberalism and economic conservatism, and that Roman 
Catholics, Protestants, Jews, and those without church affiliation fall in that 
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order along the scale from religious conservatism to liberalism. Within the 
political liberalism test itself, a cluster analysis of items indicates the pres- 
ence of three relatively separate factors: empathy with elected personnel, de- 
votion to civil liberties, and participation tendency. As is true of many simi- 
lar correlational studies in the field of attitudes, serious questions must be 
asked about the representativeness of the sampling of subjects as well as of 
the items included in the questionnaires. University students may well mani- 
fest such differentiations of sentiment about issues of liberalism-conservatism 
but the same may not be true of other, less highly selected populations, for 
whom a more generalized liberal or conservative point of view may exist. 

In a sample of British students, Sanai (107) reports significant positive 
correlations of ‘‘alterationism’’ (radicalism) with emotionality and intro- 
version, and with Theoretical and Aesthetic values on the Allport-Vernon 
test; with Religious value the correlation is negative. 

Helfant (56) finds that adolescents’ international attitudes (e.g., attitude 
toward Russia) are more closely correlated with the attitudes of their parents 
than with their own feelings of outwardly direct hostility, as measured by 
the extrapunitive scores on the Rosenzweig Picture-Frustration Test and on 
related techniques. The obtained correlations between adolescent and paren- 
tal attitudes are smaller than previously reported, due perhaps to the fact 
that in this study a more valid appraisal of parental attitudes was achieved 
than is often the case in such studies. 

In a study handicapped by inadequate design and sampling, Dreger (33) 
attempts to compare extremes of religious liberalism and conservatism, 
measured by a Salvation Opinionnaire, in respect to several measures of 
personality, such as emotional maturity, rigidity, emotional control, de- 
pendence need, etc. By and large the results are negative, 30 church going 
“liberals’’ not differing in any of the generally hypothesized ways from 30 
church-going ‘‘conservatives.’’ However, it must be noted that these com- 
parisons are within church-goers; religious ‘‘liberals’’ who are not church- 
goers might show differences not tapped in the present comparison. 

A preliminary presentation of some of the data being gathered by the 
Appeals of Communism Project at Princeton University is made by Krug- 
man (72). Intensive interviews with 50 former members of the American 
Communist Party, divided into two groups, middle-class intellectuals (jour- 
nalists, writers, artists, professionals, etc.) and trade union officials, reveal 
that the intellectuals are more likely than the union officials to have had 
emotional adjustment problems prior to joining, to have been attracted by 
the opportunities to defy authority, to have been status-anxious, and in gen- 
eral to have satisfied their unconscious needs through the Party. Such a 
study suffers, of course, from the inherent weakness of its necessary method, 
that of retrospective reporting by the ex-Communist on the motivational 
factors involved in joining. A shortcoming that could be remedied is the lack 
of comparative personality data on matched control groups of middle class 
intellectuals, and trade union officials, who had not joined the Communist 
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Party. Such data would help clarify essential differences which appear to 
exist between the two groups compared in this study. 

Political participation.—Personality correlates of political interest and 
political apathy are examined by Mussen & Wyszynski (93) in a study of 
156 undergraduates, divided into two extreme groups on the basis of ques- 
tioning concerning their present and anticipated political participation. The 
apathetic and interested groups do not differ ideologically, as measured by 
the A-S, E, F, and PeC Scales of The Authoritarian Personality study (1). 
But differences are found on a number of projective questions designed to 
elicit personality attributes previously hypothesized to relate to extent of 
political participation, for instance by Riesman & Glazer (104). Those at the 
active extreme tend to be self-accepting of their own emotions, conflicts, and 
feelings, concerned with emotional and intellectual expression rather than 
with conventional values and standards, and outwardly oriented toward 
others and their welfare. Those at the politically apathetic extreme, on the 
other hand, tend to be fundamentally hostile yet unable to accept their 
hostile impulses, and as a result appear submissive to authority, over stress- 
ing of conformity, incapable of enjoying deep emotional experience, and in 
effect unable to become genuinely concerned with others or with the general 
welfare. In general, these differences lead the authors to conclude that 


political apathy and activity are specific manifestations of more deep-lying, and per- 
vasive passive and active orientations. . . . Political activity or apathy are not func- 
tions only of the general social structure and current historical events. The personality 
of the individual operating within the socio-historical context must also be considered. 


While the foregoing formulation is provocative, one is struck by technical 
limitations of the study, especially the choice of subjects, the lack of means 
of getting at actual political participation behavior rather then mere expres- 
sion of intention, and the quite modest size of the personality differences 
found. 

Added light is thrown on determinants of participation by the study of 
Gough (48) in connection with the development of a scale to predict ‘‘social 
participation” among high school seniors. The criterion of participation was 
the number of extracurricular activities in which the student was involved. 
Extreme highs and lows in participation were compared in responses to a 
large battery of specially devised items, and a final scale of discriminating 
items was refined and validated. Study of the diagnostic items reveals that 
the more participative students tend to be characterized by greater candor 
and frankness, self-discipline, tolerance of others, and a sense of identification 
with and acceptance by the group. Although it is social participation of a 
particular sort which is here involved, it is presumed that these results have 
some significance for the problem of participation in general, including politi- 
cal participation. In this light, it is of interest that the kinds of personality 
differences isolated do provide some support for earlier views about the psy- 
chological determinants of participative action. 
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Robinson’s (106) further analysis of data from The People’s Choice 
[Lazarsfeld et al. (78)], involving a factorial analysis of different components 
of participative political activity, uncovers three independent factors: (a) spec- 
tator interest, indicated by tendency to read about the campaign, etc., but 
not necessarily related to basic involvement or voting behavior; (0) citizen 
interest, indicated by above average voting tendency, but with no particular 
attention toward influencing others; and (c) partisan interest, indicated by 
primary concern with influencing others regarding the issues and candidates. 

Wiebe (133) evaluates the actual influence of watching the televised 
Kefauver crime hearings on people’s political participation. A study of a 
sample of 260 New Yorkers showed that, although the problem was broadly 
perceived, the hearings did not stimulate the individuals to engage in con- 
structive problem solving behavior. Wiebe relates the findings to a concept 
of ‘‘social impotence’ which has bearing on reference group theory. 

A formulation of hypotheses concerning socio-psychological factors de- 
termining the difference between intention to vote and actual voting is of- 
fered by Suchman (123), with attention both to factors relating to the cam 
paign situation itself and to personality factors in the individual voter. 


ETHNOCENTRISM AND PREJUDICE 


A steady stream of studies of ethnic prejudice continues to flow, focussed 
primarily on the determinants and correlates of prejudice, the effect of per- | 
sonal contact on prejudice, the relation of expression of prejudice to overt \ 
behavior, and the comparison of prejudice among various groups. To this | 
reviewer there seem to be few truly outstanding developments in the field 
during the year’s work. The studies are mainly extensions of lines of inquiry 
previously set in motion, rather than significant departures or innovations in 
theory and method. 

Determinants and correlates of prejudice——In an investigation of personal 
values as factors in anti-Semitism, Evans (37) gave the Allport-Vernon 
Study of Values and the Levinson-Sanford Anti-Semitism Scale to college 
students. In general accord with his hypothesis, it is found that the value- 
patterns of those high in anti-Semitism stress political and economic values, 
whereas for those low in anti-Semitism stress is on aesthetic and social values. 
Another evidence of relation of personality factors to ethnic attitudes is pre- 
sented by O’Connor (97), who finds among a sample of 77 Harvard under- 
graduates that scores on the Ethnocentrism Scale are positively associated 
with “intolerance of ambiguity’”’ and with poor ability to solve abstract 
reasoning problems. In a similar vein, Block & Block (16) find that ethno- 
centrism (measured on the Berkeley Ethnocentrism Scale), submission to 
authority suggestion in continuing a spool-packing task after the subject is 
“satiated,’’ and rapid establishment of a norm in autokinetic judgment, all 
cluster together and may be inferred to reflect a compulsive ‘‘structuring” 
approach to a somewhat unpredictable environment. 

Frenkel-Brunswik & Havel (44) report a study of 81 white American 
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Gentile children, 10 to 15 years of age, interviewed about their attitudes 
toward five minority groups: Negroes, Mexicans, Chinese, Japanese, and 
Jews. The children varied from extreme highs to extreme lows on the Ethno- 
centrism Scale. Prejudice, as measured by this scale, correlates .67 with an 
over-all interview rating on general prejudice, and intercorrelations ranging 
from .69 to .97 are found among ratings on prejudice teward the various 
minority groups. These findings give some (though far from conclusive) sup- 
port to the notion of a generalized prejudice factor in the children. A tend- 
ency is found for the high ethnocentric subjects to dichotomize, to exaggerate 
adherence to conventional values, to avoid insight into the characteristics of 
the in-group, and to feel fear of threats and danger from the out-group. The 
authors see some distressing social implications in their results: 


The children’s attitudes are usually a mixture of prejudice and tolerant ideologies. 
Our high-scoring subjects, on the whole, were freer in expressing prejudice than the 
low-scoring children were in expressing tolerance. Furthermore, the middle scorers 
who are more representative of the bulk of the population show great similarity in 
their responses to the high scorers. Thus the attitudes of the children as a group are 
predominantly prejudiced ones. 


Hofstaetter (59) criticizes studies such as the above and others which 
reflect the influence of The Authoritarian Personality, as overemphasizing 
personality factors in prejudice at the expense of important sociological fac- 
tors. On the basis of what appears to be a technically faulty factorial study 
of responses to a short attitude questionnaire, he concludes that there is no 
justification for including such independent dimensions as anti-Negroism, 
anti-Semitism, national pride, Puritanism, etc., in a single type, the so-called 
‘authoritarian personality.” 

Contact and prejudice-——Observations of the relationship of intergroup 
contact and racial attitudes in such diverse settings as a Utah mining town, 
the Pocahontas coal field, department stores, and public housing projects, 
can be found in Harding (52). One of the studies included is especially notable 
both for its methodological care and its real life setting. Wilner, Walkley & 
Cook (134) analyze residential proximity and intergroup relations in public 
housing projects. Intensive interviews with large numbers of white house- 
wives provided data for assessing the amount and kind of contact with 
Negroes and the relation of degree of proximity to attitudes toward Negroes. 
Marked proximity influences are found, thus providing 


strong support for a hypothesis that has gained increasing acceptance in recent years: 
that contact among racial groups of equal or nearly equal socio-economic class and 
status-roles is a favorable condition for the modification of ethnic attitudes. The data 
from these studies indicate that this is particularly likely to be the case in situations 
where the social climate supports such contact. 


Bird, Monachesi & Burdick (15) find that in a neighborhood undergoing 
infiltration by Negroes, white residents express markedly varied racial atti- 
tudes. The majority are ambivalent in attitude, feeling both inability to over- 
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come what they recognize as “unfounded prejudice” and resentment that 
the proximity of the Negroes causes them to become aware of hitherto sup- 
pressed feelings of prejudice. It appears that one consequence of the lack of 
clarity and uniformity of the adult attitudes is that the white children are 
not afforded a consistent, clear set of evaluations of Negroes, the result being 
that their attitudes do not closely resemble their parents’ attitudes. 

Expressed attitudes and overt behavior.—Along the lines of the classic study 
of LaPiere (75) on relation of attitude expression to overt behavior, Kutner, 
Wilkins & Yarrow (73) find that restaurants when queried by mail or by 
phone about reservations for a party including both whites and Negroes at- 
tempt to avoid making such reservations, yet when parties of white and 
Negro women together actually appear at the restaurant, they are regularly 
seated and served. 

To evaluate the actual place of attitudes toward minority groups in the 
larger life-space of the residents of a small midwestern community, Brook- 
over & Holland (21) used three techniques of assessing attitudes: an attitude 
scale, a formal interview, participant observation. They find that the adults 
of the community, when directly asked, express consistent and highly un- 
favorable attitudes toward minority groups; yet at the same time in normal 
everyday conversation, relations wit minority groups are not at alla common 
topic, nor are minority groups frequently mentioned in response to non- 
directive questions. The inference is that minority group attitudes are not 
salient in this community and that answers to a direct questionnaire may not 
reflect the true attitudinal situation. 

Comparative prejudice among whites and Negroes—Three studies throw 
limited light on the matter of comparative prejudice among whites and 
Negroes. Prothro & Jensen (103) supply data on attitudes of Negro and white 
college students in the deep South with respect to the Church, Jews, whites, 
and Negroes. Contrary to previous findings, their results do not support the 
view that the attitudes of whites and Negroes are highly similar. In the same 
vein, Gray & Thompson (50) find that Negro students in Georgia consistently 
show more ethnic prejudice than white students as measured by the Bo- 
gardus Social Distance Scale. On the other hand, a subsidiary finding of the 
Bird, Monachesi & Burdick study (15) mentioned previously is that in a 
neighborhood where whites and Negroes both live, the white and Negro chil- 
dren are remarkably alike in their expression of attitudes toward Jewish 
children. 


SocrIAL PERCEPTION 


Bieri (14) reports experimental confirmation of the hypothesis that in a 
constructive interaction situation, one’s perception of another will change in 
the direction of increased similarity to oneself. Pairs of students who were 
strangers interacted in a discussion of the course and discussion of a hypo- 
thetical vacation together. Before and after the interaction situation, each 
subject predicted the other’s responses on a modified Rosenzweig Picture- 
Frustration Test. 
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The influence of the individual’s value-system on his perception of people 
is investigated by Fensterheim & Tresselt (38). The objects of judgment were 
24 pictures of men and women. The subjects rated each picture on a like-dis- 
like dimension and also assigned to each that one of six statements drawn 
from the six Allport-Vernon values which best described the picture and that 
one which least-well described it. The hypothesis that the perceiver’s own 
major values will be projected in his perception of liked pictures is confirmed. 
The liked stimuli are judged to have the subject’s own high values. A further 
hypothesis that the subject will use his own major values as ‘‘anchoring 
points” for judging the pictures is not confirmed; rather, his own low values 
tend to serve as anchors. The interpretation offered is in terms of the 
“strangeness” of the stimuli. 

On the other hand, evidence that perception of other people’s values is 
by no means entirely a matter of projection is shown by Precker (101). All 
242 student members of a college community ranked 39 value-criteria as 
pertaining to themselves. In addition, sociometric choices were made of peer- 
associates and of near-authority figures (advisors). The subjects tend to 
choose persons in both categories whose value-patterns objectively resemble 
their own. This tendency is most marked in the case of reciprocal socio- 
metric choices. 

Hastorf & Bender (54) emphasize the need for theoretical and operational 
differentiation of empathy and projection in the judgments of other people. 
Each of 50 students took the Allport-Vernon test of values and also predicted 
the value scores for another student he knew well. Two deviation scores were 
obtained: one, the difference between predicted and true value score (em- 
pathy); the other, the difference between predicted and self-score (projec- 
tion). For 20 of the subjects predictions were closer to the true scores of the 
others; for 28 the predictions were closer to own scores. The authors charac- 
terize the above two types of subjects as “‘empathizers” and ‘‘projectors,”’ a 
distinction the value of which will remain unclear until some sharper way to 
differentiate the types can be found, and validity tested. 

Fiedler, Warrington & Blaisdell (42) find that fraternity men perceive 
fellow group members they like best to be more like themselves than fellow 
group members they like least, although there is no such objective relation- 
ship found between the self-descriptions, obtained through a Q-sort pro- 
cedure. 

Following along lines of thought emerging from the Berkeley studies of 
the “‘authoritarian personality,’’ Scodel & Mussen (110) test the hypothesis 


that authoritarians, because of their lack of insight into others and their need to con- 
sider themselves members of the in-group, would perceive non-authoritarian peers 
to have attitudes and personality characteristics similar to their own. Non-authoritar- 
ians, on the other hand, being more insightful and more individualized in their ap- 
proaches to others, would judge authoritarians accurately with respect to attitudes 
and personality characteristics. 
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Twenty-seven pairs of subjects, each consisting of a person scoring high 
on the F-scale (authoritarian) and a person scoring low (nonauthoritarian) 
interacted in a brief neutral discussion session. Afterwards each member 
tried to predict how his partner would answer the F-scale items and 30 
selected MMPI items. The results are taken to confirm the hypothesis, inas- 
much as the highs do not perceive lows as having F scores significantly dif- 
ferent from their own, whereas lows do ascribe to the highs F scores that are 
significantly higher than their own. Some reservations may be entered, how- 
ever. For one thing, the lows do only slightly better than the highs in ac- 
curacy of prediction of the partner’s MMPI responses, and this indicates a 
limitation upon the hypothesized differences in perceptual functioning in the 
two groups. For another thing, the lows may not be more accurate in their F 
score prediction, but may merely express a stereotype of their own that 
others are more ‘‘authoritarian’”’ than themselves. Needed controls could be 
introduced by also having the lows judge lows and the highs judge highs. 

Far off the beaten track of most of the foregoing studies of social percep- 
tion, which primarily concern themselves with the mechanism of projection 
in perceiving others, is a study by Wittreich (135) on the so-called ‘“‘Honi 
phenomenon.”’ Married persons looked at their spouses and at strangers in 
the two windows of the Ames’ distorting room and noticed that the distortion 
in size of the spouse was less than in the case of the stranger. In a check 
experiment in which the person being judged walked from one corner of a 
large distorting room to the opposite corner, changing perceptually from 
small through normal to large in size, the above finding is verified, namely, 
that the marital partner is perceptually more stable in normal size than a 
stranger. The finding is a remarkable one and because of its theoretical im- 
portance should be further investigated with a variety of needed controls, 
especially those concerned with determining whether the phenomenon is 
essentially judgmental rather than perceptual and the conditions which 
favor or disfavor the effect. ' 

In a curious experiment by Cofer & Dunn (25) evidence is found that 
previous reinforcement of the words “hot” or “‘cold”’ for two groups of sub- 
jects significantly influence the way they rate related personality traits of 
photographed persons. The finding may have implication for the kind of 
earlier studies of social perception carried out by Asch (6), Kelley, and others. 


SocIAL PSYCHOLOGY AND THE ATOMIC AGE 


In accepting the fifth Kurt Lewin Memorial Award, Myrdal (94) dis- 
cusses some of the psychological impediments to effective international co- 
operation. Bauer (10) writes an enlightening account of basic changes in the 
Soviet conception of human nature, brought about as a function of ideologi- 
cal, practical, and political pressures, a book which is both an essay in psy- 
chological history and a study in political control. 

But aside from such rather general exceptions, it can not escape attention 
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that this review of the year’s work in social psychology includes little that is 
aimed directly at the deepest social issue of our atomic age—international 
conflict. 

Perhaps this is as it should be. It may indeed be true that in our longer 
perspective (assuming that such shall continue to exist), the greatest con- 
tribution that scientific social psychology can make to the problem of inter- 
national conflict is in increasing our understanding of the concrete processes 
of group dynamics, attitude change, prejudice, and social perception, to 
which most of the year’s research has been devoted. 
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INDUSTRIAL PSYCHOLOGY’ 


By ALASTAIR HERON? 


Unit for Research in Occupational Adaptation, Medical Research Council, 
London, England 


It may well be that in the last five years we have experienced the end of 
an era in the history of industrial psychology. No startling development 
took place, no text appeared to establish a landmark, no new theory pro- 
voked widespread discussion and opened fresh vistas; but perhaps something 
less sensational may be detected. Discontent can sometimes be divine, 
provided it results in thinking which ultimately issues in more appropriate 
activity. Such a kind of discontent has appeared among psychologists con- 
cerned with the application of their scientific insights and skills to the prob- 
lems of an industrial society. Some of the necessary thinking seems to have 
been done, and a little of it has been evidenced by changing activities. 

Editorial policy and critical comment are agreed that the task of those 
called on to write these annual chapters is mainly ‘‘to derive some kind of 
order, even if only provisional, from the chaotic mass of current publications; 
to provide context, note trends, and as far as possible assess significance”’ 
(54). Consequently, though the literature has been exhaustively examined, 
no attempt is made to report more than a few of the books and articles ap- 
pearing during the year. Those referred to have been selected mainly on 
the criteria of quality, interest, or novelty, but a few are added more as 
warnings than as models. Quality is not always measured in terms of ex- 
perimental or statistical elegance, but it must be said that many reports 
have been passed by for gross defect in these respects. Interest has been 
interpreted in as catholic a manner as the limitations of the reviewer per- 
mitted, and novelty may often mean little more than a promising variation 
upon an old and familiar theme. 

In order to set in perspective the published output of a single year it 
first seems desirable to stand back and take a quick look at our subject. The 
resulting selection appears so striking when kept in strict chronological order 
that no attempt has been made to organize it in other ways. More than 25 
years ago it was already agreed that one of the functions of the British In- 
dustrial Health Research Board was ‘‘to advise the Medical Research Council 
upon the best means of securing the fullest application of the results of (its 
research) work to the needs of industry.” By 1928 it was recognized that the 
objectives of industrial psychology were to acquire understanding and con- 


1 The period fully covered by this review ends at March 31, 1953, but a few ad- 
ditional books and journals were seen in April, 1953. 

2 The reviewer is very grateful to Professors Aubrey Lewis and L. S. Hearnshaw, 
and to Drs. H. J. Eysenck, C. B. Frisby, and Stephen Wiseman, for their critical 
comments upon a first draft of this chapter. 
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trol of the relations between the worker on the one hand, and his work, his 
work-mates, his supervisor, and his employer on the other. At that time the 
emphasis was explicitly stated as ‘“‘being placed on the individual rather than 
on the group.”” Weber, Viteles, and others were already drawing attention 
to the criterion problem and the issue about the ‘‘one best way”’ had been 
stated by C. S. Myers and others. In 1930 Myers reminded psychologists 
in a characteristically balanced way of their twofold responsibility to sta- 
tistical procedures: 


Nothing is more important than that the experimental psychologist (sic!) should be 
well-grounded in the theory and practice of statistical measurement. But at the same 
time nothing is more important than that he should know when and how to use this 
statistical knowledge and skill, employing them not merely mechanically and mathe- 
matically but with due regard to psychological considerations. 


In 1932 Viteles felt obliged to draw attention to the tendency of industrial 
investigators to duplicate testing devices ‘‘at a time when the more complete 
evaluation and standardisation of existing apparatus and techniques are 
most needed.’’ He was able to quote Kornhauser’s 1930 challenge, ‘‘Among 
the few greatest questions of our age is that which asks what modern industry 
means to the individual worker with reference to his satisfactions and fullness 
of life,” and to conclude his still classic text (112) by placing an unerring 
finger upon the crucial role of the business executive. 

Fifteen years of economic depression and war with their consequences 
bring us, in 1947, to Maier’s recognition that ‘‘the most undeveloped aspect 
of industrial progress is management of labour power, an aspect which must 
be considered seriously in the scientific age’’ (79). A year later, Ghiselli & 
Brown stated the new emphases of industrial psychology to be implicit in 
an aim of ‘‘maximum production consistent with the abilities, energies, 
interests and motives of the worker” (43). By 1950 Shartle (99) could point 
out that the Armed Services work of psychologists had been ‘‘well done and 
well sold’’ but “short in criteria’ and with “insufficient emphasis on leader- 
ship.’’ Quoting Wagner, Shartle redirected attention to the interview and 
the problem of integrating the data obtained. A year later Bellows (12) 
could see an increase in leadership studies and the use of the leaderless group 
discussion technique. In 1952 Brown & Ghiselli (18) were expressing anxiety 
about a large number of articles on tests (echoing Viteles just 20 years 
before) and stressing the current need for more work on placement and 
training and for psychologists to know industry and report their work 
accurately and adequately. They noted the continuation of ‘‘the relatively 
recent interest ...in group problems” and a trend towards “the develop- 
ment and use of personality tests” in selection work. Last year Harrell (51) 
noticed the increased popularity of studies concerned with ‘the effectiveness 
of training supervisors in human relations” and with “‘the relations of super- 
visors and employees.” 

As we pass on now to glance briefly at a highly selected sample of recent 
endeavour, much will seem familiar: sometimes because it has been going on 
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for 30 years; sometimes because 20 years ago our elders had seen it to be 
important, but we have only just got round to it; and sometimes because it 
is in line with current fashion. In such a panorama of familiarity, it may be 
relatively easy to note anything strange and new. 


EUROPEAN WoRK 


A useful glimpse of a cross section of the work of European psychologists 
is available in Applied Psychology, the Proceedings of the 1951 International 
Congress of Psychotechnics (33). It contains summaries, varying in length, 
of 40 papers delivered by psychologists from Italy, France, Sweden, Ger- 
many, Yugoslavia, Belgium, Portugal, Austria, Norway, Switzerland, Great 
Britain, Denmark, and Holland. Heterogeneity of methodological sophistica- 
tion characterises the general picture. Many papers reflect the continued 
disregard of statistical evaluation and sound experimental design which is by 
no means confined to the Continent. Scandinavian papers evidence an 
increasing interest in both the ‘‘testing’’ and the “human relations” ap- 
proaches. 

The titles of four French journals (21, 22, 88, 107) are listed for conven- 
ience in the bibliography at the end of this chapter. Their contents reflect a 
trend toward the more refined statistical methods such as factor analysis, 
discriminant function, and sequential analysis. Psychomotor tests, many of 
them ingenious, still dominate the selection field, but the close attention 
paid to work done in the United States and Great Britain is likely to lead 
to the use of objective personality tests as these become available. Attention 
is also drawn to La Revue Suisse de Psychologie, which publishes some papers 
in German or in French that are of interest to industrial psychologists, and 
to Mensch und Arbeit, which is published in Vienna. Articles in these journals 
are almost without exception qualitative in nature, but if the attention of 
British and American industrial psychologists were redirected towards a 
search for fresh hypotheses, it would be both smug and foolish not to acquaint 
themselves with the careful thinking of their Continental colleagues. A note 
by Froehlich & Davis (38), based on ‘‘visits to Germany in 1948, 1951, and 
1952, an incomplete review of current German literature, and personal con- 
sultation with German nationals visiting America” accords with the informa- 
tion reaching this reviewer through similar means. These authors are mainly 
concerned with vocational guidance and in this connection stress the con- 
tinued lack of standardised tests and the use of subjective evaluations pre- 
pared by the psychologist for the vocational advisor. 

There are without doubt marked differences in training, outlook, and 
practice between the majority of Continental psychologists and those active 
elsewhere. Much has been done to further the common cause, but a great 
need remains for adequate provision of opportunities for personal encounter 
over periods measurable in weeks or months rather than the hectic few 
days of an infrequent congress. The same is also true, though to a lesser 
extent, of those active in research in Britain and in North America. 
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GENERAL TEXTBOOKS 


Among the new and revised textbooks appearing during the year, two 
appear worthy of special mention. A half-length version of his Applied Psy- 
chology has been published by Burtt (23), who in spite of readable and fre- 
quently “popular” writing, still manages to underpin most of his material 
with the necessary facts. His statistical appendix for beginners serves, how- 
ever, to remind this reviewer that it really is about time that more psycholo- 
gists were introduced to Kendall’s Rank Correlation Methods (68) and the 
virtues of “‘tau”’; “rho” has had a very long inning without competition. 
Readings in Experimental Industrial Psychology, Blum’s (15) valuable collec- 
tion of papers already well known to the active research worker, but not 
known nearly well enough to the students, the novices, and many others 
whom it is harder to excuse, may do something to raise the standards of 
research design and reporting to a new and higher level. 

The writer may perhaps be forgiven for expressing his view that we could 
well do without any further textbooks of ‘Industrial Psychology” until a 
really new one becomes possible. 


Jos EVALUATION AND WorRK MEASUREMENT 


Ninth in the series of Purdue ‘‘Studies of Job Evaluation”’ (the eighth 
appeared in 1949) is one by Miles (82) on the development of a check list for 
office jobs, originally designed by Culbertson and Dudek and described in 
unpublished theses. For some reason or other this had to be read with in- 
tense concentration several times before it became clear just what was being 
compared with what. If others experience similar difficulties it will be a 
pity, because the design appears sound and the outcome encouraging. Per- 
haps the difficulty lies in the old phenomenon of overfamiliarity with the 
data on the part of the article-writer, which reviewers and abstracters en- 
counter all too frequently in spite of the ceaseless work of editors. Howard & 
Schutz (61) use factor analysis to continue the pastime of exposing the essen- 
tial unidimensionality of job evaluation schemes, accounting for 99 per 
cent of the variance among 11 ‘‘attributes’’ by means of a single factor which 
they identify as ‘‘skill demands.’”’ One may express the hope that the time 
has now come to call a halt; there are other more important exercises for 
Ph.D. candidates to do. Davis (29) draws fresh attention to the psychologi- 
cal consequences of leaving the design of ‘payment systems” entirely in the 
hands of production engineers and work-study experts. Her investigation, 
carried out in five modern factories, covered 386 men and 402 women by 
means of individual interviews. Two-thirds were dissatisfied in varying de- 
grees with the group bonus systems in force, and in no group were more than 
half the workers prepared to describe themselves as satisfied. She concludes 
that the resulting conflicts are basically emotional rather than intellectual 
and recommends that general policy should be framed jointly by manage- 
ment, personnel officers, and medical officers, regarding the work-study engi- 
neers as technical experts who would be asked to “implement the policy.”’ 
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While on the subject of work measurement, three papers may be men- 
tioned from Smith and various collaborators (93, 108, 115) describing further 
work with the ‘‘Universal Motion Analyser,” an electronic device for meas- 
uring travelling time and manipulative time separately. They report (a) 
that ‘‘pull’” movements are significantly slower than “turn”? movements; 
(b) that contrary to motion-study statements, extending distance of travel 
does also increase manipulative time; (c) learning affects the manipulative 
component three times as much as the travel one; (d) that in the type of 
motions analysed by them it is not true that increased pattern complexity 
reduces movement efficiency; and (e) that some positive transfer effect can 
be observed on the manipulative component, but not on travel. Summarised, 
these data strongly suggest need for research into the manipulative compo- 
nents of work behaviour with less attention than heretofore on the travel 
aspect except to keep it short. It will be interesting to observe the printed 
reaction of the motion-study pundits to what will undoubtedly be described 
as an artificial laboratory set-up from which no generalisations into real 
life can validly be made. Psychologists interested in this field will do well 
to consider how such blanket criticisms can be met by careful evaluation of 
the motivational elements in such studies. We do not know as yet, for 
example, whether differences in such elements would operate equally on 
both travel and manipulative components of motion. If it could be shown 
that they do, the work now reported would rest on firmer foundations. 

Industrial psychologists will probably hail with relief the important book 
by Abruzzi, Work Measurement (1), in which he describes the application of 
statistics of quality control to both time and motion study. His research 
goal was ‘‘to develop estimating procedures that could be verified in terms 
of the modern theory of experimental inference.’ His results provide the 
long-awaited demonstration that standard data procedures are largely 
unvalidated, and that as psychologists have long suggested, there is no ‘‘one 
best way”; he also places on the shoulders of the scientific work-study 
experts a heavy responsibility to abolish ratings of worker performance, and 
to substitute objective procedures such as those he describes. Abruzzi’s 
data showing that ‘‘independent standard elements’’ in time study and ‘‘in- 
dependent motions” in motion analysis are frequently correlated, are strongly 
reminiscent of equally untenable claims for the independence of trait ratings 
which can also be shown as intercorrelated. In all these fields the inde- 
pendence has too long rested on intuitive judgments. 


VOCATIONAL GUIDANCE 


Those who naively imagine that vocational guidance is either a highly 
developed precision instrument or a rather hit-and-miss affair will do well to 
read Stott’s honest presentation of the problem of validation in this field 
(103). Like the interview itself, the case is not proven; unlike the interview, 
validation problems are intrinsically resistant to solution. Paradoxically, 
it is from vocational advisors that most help should be forthcoming if we 
are to see the improvement of the interview by alternate modification and 








208 HERON 


re-evaluation through rigorous statistical treatment. Jahoda (64), in a val- 
uable interview study of school leavers, their job attitudes and choice of first 
job, found that present youth employment services come into action too 
late because the informal influence of parents, relatives, and friends has 
already been effective in the majority of cases. This study is distinguished 
by the way in which it combines thorough field work with awareness of the 
need for statistical checks on the data obtained. In amplifying a ‘“‘theory 
of occupational choice—a largely irreversible process involving compromise 
as an essential aspect,’’ Ginzberg (46) draws attention to a pressing need 
for more and better research into the relation of emotional factors to work, 
especially in distinguishing the ‘‘work-oriented from the pleasure-oriented 
person.’’ It may be commented here that although we are familiar with both 
these kinds of people (though usually by other names), we are still largely in 
a state of ignorance concerning the way in which such personalities are 
formed or the extent to which they are modifiable. Although it may prove 
even more difficult to carry out suitable research than to formulate it, this 
nevertheless appears to be one of the points at which psychologists in 
various fields could, and should, get together. 

Strong (104) continues publication of long term follow-up data on his 
Vocational Interest Blank with material about engineer interests. The note- 
worthy point in this 19 years’ follow-up is his statement that assumed 
equivalence of ‘‘A-ratings’’ can be dangerous and that it is the occupation 
with the highest score which is most likely to count. It may be hoped that 
this will not have the effect of stimulating unfounded claims to precise score 
values, especially as it is clear that Strong supports nothing of the kind. 
The assessment of vocational interests by questionnaire methods has been 
extended by the monograph of Strong & Tucker (105). Working under con- 
tract for the United States Army Medical Corps, and assisted by a team of 
five graduate students, they developed blanks for the medical specialties 
of internal medicine, surgery, pathology, and psychiatry, as well as taking 
the opportunity to revise the ‘‘physicians-in-general” scale. This mono- 
graph is especially recommended to the attention of those who are dubious 
about this sort of approach, because the facts necessary for evaluation are 
all there, either in the text or in the six technical appendices. This reviewer’s 
impression is that Strong has strengthened his case. 

Garry (40) showed that Strong Vocational Interest scores can be con- 
sistently faked on request, and that in the student population this ability 
was uncorrelated with intelligence, sex, or information about the occupation. 
As this is in line with Strong’s own statement and that of others, and with 
findings on several personality inventories, we now need a different kind of 
investigation showing in a variety of motive situations how much faking 
is actually done. A possible model for such research is Green’s 1951 study 
of patrolmen (49), in which he used the same personality schedules and in- 
ventories in circumstances (a) when the situation was likely to motivate 
responses in the direction of faking and (5) in another real-life situation in 
which similar respondents knew that nothing depended on their answers. 
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Zuckerman (126) seems justified in claiming the results to be clear-cut in an 
article reporting his comparison of like/indifferent/dislike response arrange- 
ment with “forced-choice” layout in vocational interests blanks. No signifi- 
cant difference was found between the two arrangements, as assessed by 
discriminatory power and split-half reliabilities, but he is right to point 
out that the results obtained may be peculiar to the group of professional 
persons studied. 

Gustad (50) contributes to current work on the Occupational Interests 
Level Scale with a paper in which by clearly presented statistical data he 
shows that sampling problems are as crucial in this connection as in others 
and that college populations have their limitations even if only through 
restriction of range. The pity is that such discoveries are so frequently made 
after the completion of research projects. The same scale was investigated 
also by Barnett et al. (7) in separate studies synthesised into a monograph 
under the direction of Super. Of these the most interesting is perhaps Bar- 
nett’s pioneer study of vagrants in which he suggests that the scale might 
be used as an ‘‘index of the occupational and/or social level which the indi- 
vidual is likely to find congenial.’’ Worthy of comment is the way in which 
Super has here made real the concept of a team approach in doctoral research 
administration; may we see more of it. Not to be overlooked is a suggestion 
by Steinberg (102) that the Kuder Preference Record may be useful in voca- 
tional guidance as a means of detecting neurotic tendencies. His nonneurotic 
group was as impure as usual in such studies, but he still found differences 
significant at the 1 per cent level, justifying a suggestion that further work 
along these lines might be profitable. 


SELECTION AND PLACEMENT 


Application forms.—In a chapter on the applications of psychology in 
the Defence departments, appearing in Current Trends in British Psychology, 
Wilson (121) quotes an interesting illustration of the validity of the Royal 
Navy’s biographical data form which it takes 15 min. to administer. An ex- 
perienced personnel selection officer used the form alone to grade the suitabil- 
ity of 500 men for a training course, and obtained a validity of .554 with 
“success in the course”’ (criterion not described). The standard test battery, 
with which her grading correlated .312, had a validity of only .393. In con- 
cluding the summary in which this example is given, Wilson makes the im- 
portant point that although the actual work of the Armed Forces psycholo- 
gists appears to be mainly technological research, the solutions they find de- 
pend always upon quite general developments in psychological or statistical 
work. This reference to the need for unity in psychology is strongly reminis- 
cent of one of the last papers of Bingham (14), who died during the year. 
Further reference to this topic will be found in the concluding section of this 
chapter. A useful contribution to the problem of accuracy with application 
blanks comes from Mosel & Cozan (84) who commendably used evaluative 
procedures identical with those of Keating, Paterson & Stone (67) in order to 
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permit comparison of findings. The principal result draws attention to the 
direction of distortion rather than to its amount, which is small. 

Interviewing.—The shortcomings of psychometric prediction are so 
thoroughly publicized by those who use such methods, that it is hardly 
surprising to find repeated claims for the so-called ‘“‘clinical approach”’ as 
an alternative. Harrison & Jackson (52) consider that ‘“‘the effect of applying 
these (clinical) methods can be objectively and empirically validated.” 
Readers are invited to consider the resulting paper, without bias if possible, 
and to draw their own conclusions. This reviewer’s concept of validation 
differs from that of Harrison & Jackson but more broadminded critics may 
be satisfied. Kephart’s The Employment Interview in Industry (69), a useful 
book for personnel managers, is inaccurately titled. ‘‘Personnel Selection in 
Industry” for the title might have implied too much but on the whole 
would have been nearer the mark. Very much more than interviewing is 
described in order to put the actual interview as such into perspective, and 
in fact the treatment of the interview itself is on the slight side. Industrial 
psychologists interested in the interview and who have not yet studied the 
relevant chapters in Vernon & Parry’s book (111) are, in the light of Kep- 
hart’s treatment of the subject, strongly urged to do so. Rodger (90) makes 
a spirited defence of the interview on the principal grounds that because 
people will in any case go on using interviews the task of the psychologist 
is to remove the defects. While supporting the use of objective tests and 
the search for better ones, he considers the case against the interview as 
not proven. With this balanced judgment the present reviewer concurs; 
but what a job it is getting interview users to advance the cause of scientific 
rigour! 

Tests—We may open this section with a reference to Brown & Ghiselli 
(19), who continue in their role of avuncular advisors by offering a reminder 
of the dangers implicit in using valid predictors of training success as predic- 
tors of job proficiency or vice versa. In another paper (20) they provide 
fresh data obtained from taxicab drivers concerning the relation between 
job tenure or labour turnover and various tests used as predictors. In general 
these data support previous findings of U-shaped relationships between 
tests and turnover, but the exceptions seem enough to preclude any firm 
pronouncement. In this reviewer’s experience the only safe procedure is 
actually to examine the shape of the regression in every instance. An insig- 
nificant correlation does not always mean no predictive value, and critical 
cutoff points may exist in any particular instance. 

Ralph & Taylor (87) used the General Aptitude Test Battery twice on 
a class of 49 medical students, using grade-point averages after five quarters’ 
training as first criterion, and those after the full four years’ course as second 
criterion. In spite of preselection and consequent restriction of range the 
verbal-numerical-spatial combination gave multiple R’s of .56 and .60 on the 
two occasions. This finding is unusual in medical student selection, suggesting 
deficiencies both in the original selection procedures and in the make-up of 
the criteria, neither of which is the fault of these investigators. They are 
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undoubtedly justified in asking for further research and especially for 
“thorough evaluation by means of well-designed studies’”’ before any new 
procedures are widely installed. In this connection considerable interest will 
be focussed on the 10 years’ study under way in England under the auspices 
of the Nuffield Trust. 

In a brief report Wesman (118) gives some five years’ follow-up data 
about the Differential Aptitude Tests. Among the points stressed by him 
it is as well to mention (a) the specificity of validity, (b) the evidence ob- 
tained for the depression of validities as a result of imperfect criteria such as 
course grades, and (c) the consistently greater predictability of grades for 
girls than for boys. Levine & Tupes (76) confirm the continuing validity of 
the United States Air Force pilot stanine, quoting biserial coefficients all 
around .55 to .60, and provide data to suggest that it would not be difficult 
to raise it still higher by substituting certain tests now available and by re- 
keying the scoring of existing tests. 

Chriswell’s report (26) of a structural dexterity test is less valuable than 
it might have been simply because he apparently rushed into print before 
completing the evaluation. It is not good enough to say ‘“‘more significant 
results might be obtained with age held constant’’; that is the reporter’s 
job, not the reader’s. Similarly it is not satisfactory to report two selected 
multiple R’s out of a possible number of at least five without stating either 
the basis of selection or any of the usual information required by the reader 
in order to evaluate the quoted figures. In spite of these defects of reporting, 
the test itself looks promising enough to justify further trial under research 
conditions. Wyndham (123), describing selection of machine shop operators 
using drills, taps, and hand presses, reports a multiple R of .50, using the 
Minnesota Rate of Manipulation Test (both parts) plus an ad hoc ‘‘de- 
burring test,’’ with bonus earnings as criterion. Reliability of the ad hoc 
test is estimated as averaging about .8 using the intercorrelations of half- 
tests over four trials. The sample was 40 experienced operators and no 
cross validation is yet available. Goguelin (47) gives further details of his 
work on the selection of Paris streetcar drivers using Lahy’s battery of tests. 
This paper provides a Thurstone factor analysis by the complete centroid 
method leading to four interpreted factors, based on the intercorrelations 
of 33 tests with age held constant. The four factors are described as ‘‘mental 
adaptation,” “concrete intelligence,” ‘physiological adaptation,’’ and 
“duration of reaction.’’ The sample consisted of 284 drivers. A second sample 
of 450 was tested with nine of the tests in the first battery and two rotated 
factors corresponded in part with two of the factors from the first analysis. 
He also supplies a table of test/retest coefficients over periods between six 
months and one year in respect of portions of his original sample of subjects; 
N’s are either 40 or 26. The coefficients range from —.31 to +.83, which 
would seem to raise some problems for the users of the tests in selection 
work. Nevertheless it is good to see such data analysed exhaustively in this 
way instead of being accepted uncritically on an impressionistic basis as is 
so commonly done. Germaine, Browne & Bellows (42) ask some awkward 
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questions about physical profiling systems at the end of their useful summary 
of those developed by the Armed Forces of Great Britain, Canada, and the 
United States. In particular they state themselves as “‘unable to find any 
reports of studies regarding reliability and validity of these systems.” 

The literature on that aspect of prediction work misnamed “‘personality”’ 
continues to be scanty. Krathwohl (73), in an attempt to assess the speci- 
ficity of “‘industriousness” by means of the differences between achievement 
and aptitude scores, concludes that he has extended the findings of Hart- 
shorne, May & Shuttleworth (53) by showing that an individual should not 
be considered industrious as such but rather as industrious in particular 
activities. Those wishing to continue trials of personality inventories in 
employee selection had better read Ghiselli & Barthol’s cautionary two-page 
note (44). It might be added that the predictive value of some of the low but 
significant validities quoted should always be sized-up against costs in time 
and personnel for administration and scoring. Objectively scored tests of 
emotional stability of the type described by Eysenck (34) have been used by 
Heron in two studies reported during the year. In one of these (59), forming 
part of an intensive medical study of the incidence of rheumatism among 
British colliery workers, it was shown that as one proceeds from colliery 
office, through surface maintenance men, and those spending part-time 
above ground and part-time below, to the full-time face-workers, instability 
as measured by the test battery increases. In the other (57), where the object 
was to ascertain the extent to which psychological handicap is reflected in 
occupational maladjustment of unskilled workers, significant correlations 
were obtained with highly reliable supervisory ratings. These studies need 
further confirmation with other groups of unskilled workers before the pre- 
dictive value of the tests employed can be established but may be said to 
represent a breakaway from interviews and multidimensional inventories. 
Heller (55), in a study of female hosiery workers, used a simplified form of 
the actual linking operation to predict success among new trainees by means 
of level-of-aspiration scoring. Pending replication, the value of the report 
lies at present in the attempt to extend level of aspiration studies to stand- 
ardised tests of a work sample type. 

Group selection methods.—Four further papers by Bass and his collabora- 
tors (8, 9, 10, 122) continue his examination of the leaderless group discussion 
technique. Data obtained from university students and Army and Air Force 
cadets include validities (using criterion ratings by peers or superiors) rang- 
ing from .17 to .42 and reliabilities from .80 to .88. Pooled discussions which 
are quoted in one of the papers show that, using the same data, validities are 
somewhat higher. It is suggested that low validities may be in part a result 
of restriction in range of criterion ratings dependent upon combination of 
candidates available in a given study, and narrow range of discriminations 
by discussion observers. Scores for leadership group discussion were found 
to correlate .37 with ascendancy and .22 with sociability on the Guilford- 
Zimmerman Temperament Survey, using a population of college women. 
Using a group of strangers instead of people acquainted with one another 
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they obtained an average reliability of .92 and a weighted average correlation 
with future leadership status eight months later of .47. Semeonoff (97) draws 
attention to the possibility that L.G.D. reliability measurement could be 
tackled somewhat differently by means of separate measures of the number 
of times each particular participator was ‘‘active” (defined as speaking) and 
“‘passive’’ (was spoken to) respectively. He hypothesises that if two active 
(A) counts and two passive (P) counts intercorrelate highly within each 
category, reliability is demonstrated. He contrasts this with his own (un- 
published) observations that A/P correlations were often greater than A/A 
or P/P ones, suggesting lack of consistent leadership behaviour. Evaluation 
of the group selection procedure is somewhat furthered by Higham (60) in 
respect of 50 sales representatives chosen for the chocolate and cocoa firm 
involved. Using as criterion a five-point grading on general efficiency by the 
sales manager, 74 per cent were regarded as successful, 18 per cent left volun- 
tarily although satisfactory, and 8 per cent were asked to leave. It is note- 
worthy that the chosen 50 were apparently selected largely on the group 
procedure as no differences could be shown between them and 200 rejects, on 
test scores, schooling, interests, or careers in the Armed Services. 

When these data are added to the summaries already provided in recent 
reviews, it seems that this technique has possibilities in certain circumstances 
and it is greatly to be hoped that in spite of the difficulties of validation those 
using it in varying situations will feel a high sense of responsibility in order 
to ensure a continuing flow of reliable information. We are as yet without 
important data concerning the limitations upon the use of the technique 
which may arise in connection with the variables of intelligence, education, 
or social class. 


CRITERIA 


In a thoughtful, provocative, and realistic address to his French col- 
leagues, Bonnardel says, ‘‘The examination of criteria of success is some- 
thing which psychologists must carry out systematically before all else’’ (16). 
He would therefore share the reviewer’s pleasure at finding a considerable 
increase in the number of useful studies in this field. Severin (98) exposes 
some of the gaps in our knowledge concerning the ability of tests and ratings 
to predict job performance, stressing in the process the danger of substitut- 
ing one criterion for another without prior determination of their equiva- 
lence, and adds his support to the current caveats about substituting training 
records for job performance criteria. Fruchter (39) provides an interesting 
suggestive illustration of the Guilford-Gulliksen thesis that criteria as well as 
predictors should be factor-analysed. If someone with adequate research 
facilities will now do what Fruchter would like to have done, they will have 
rendered a considerable service. A plea may be entered here that such work 
should be carried out somewhere else than in the Armed Services or a uni- 
versity. Hemphill & Sechrest (56) provide a depressingly interesting exam- 
ple of the possible lack of meaning in highly self-consistent criteria based on 
ratings. This is a study of criteria of air crew effectiveness where both su- 
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periors’ ratings and sociometric indexes correlated significantly with bombing 
accuracy (a criterion of near zero reliability) because both the former had 
presumably used knowledge of bombing activity reports as part-bases for 
their judgments. In a study of Army noncommissioned officers Klieger & 
Mosel (72) provide data which suggest that the reliability of ratings is af- 
fected not only by the degree of ratee acquaintance, but also by the manner 
in which experience with the ratee is perceptually organised through status 
and role relationships. 

The considerable literature on sales success through the years is not par- 
ticularly encouraging reading, but a paper by Rush (94) may suggest why 
this is so. He used factor analysis to demonstrate that criteria of sales suc- 
cess were complex, and multiple correlation to suggest that differential pre- 
diction of various subcriteria was both feasible and desirable. The construc- 
tion and use of a report form suitable for clerical officers in a large bank is 
reported by O’Neil (86). The special feature of this attempt is the apparently 
successful outcome of a determination to revise descriptive phrases in each of 
the eight five-item ratings until an approximately normal distribution was 
achieved instead of the initial (and usual) bunching and negative skewing. 
Factor analysis, however, indicated that the final outcome is probably a 
highly reliable rating of a single overall trait of efficiency. A complete ab- 
sence of reference to the work of others mars this businesslike article. 
Weschler, Massarik & Tannenbaum (117) found clearly significant differ- 
ences in the United States Civil Service Commission efficiency ratings when 
experimental conditions were introduced, by far the greater number of shifts 
being in the downward, i.e., less favourable, direction. Does this do more 
than confirm yet again the fact that the error of leniency is nearly always 
with us? If it does no more, then perhaps at least one or two additional 
people will be stimulated into a re-examination of their merit-rating pro- 
cedures. 

Acquaintance or buddy ratings were used by Husén (62) in a study for the 
Swedish Army as an additional criterion for validating psychological test 
batteries designed to select officer cadets. The method of paired comparisons 
was used within units of 20 students, involving for three variables 11,400 
comparisons per unit. High or fairly high correlations were found, that be- 
tween total rank order (TRO) by fellow students on all three variables 
(military fitness, intelligence, and capacity for leadership) and commanding 
officer’s rank order (CORO) being .81; that between TRO and self rank 
order (SRO), .64; and between CORO and SRO, .59. Husén raised the prob- 
lem of criterion selection and decided in this case that the acquaintance 
ratings can be used for establishing the relevance of a test to military achieve- 
ment but not as a criterion of cadet school achievement. For research pur- 
poses Heron (58) established two criteria of occupational adjustment among 
unskilled workers. One in effect validated a factory work-study index by 
comparing equated production over various types of work with actual counts 
of units produced of a single specified type. The other, a criterion of job 
adjustment defined as the extent to which a man was a source of concern to 
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his supervisor, was found to have a retest reliability over a five weeks’ inter- 
val of .78, and also shown by factor analysis and analysis of variance to be 
unidimensional for six raters. The technique, which involved a double sorting 
procedure leading to a forced normal distribution on a five-point descriptive 
scale, has proved serviceable in other situations. 

Stagner, Flebbe & Wood (101) introduced a paper with the statement 
that “the number of studies dealing with job satisfaction has become so 
large that the newcomer to the field may well be appalled.”” One can only 
hope that the newcomer will be more appalled by the low correlation between 
quality and quantity than by the absolute number of studies. Stagner et al. 
find union-management relations, grievance handling, supervision, and work- 
ing conditions ranking 1 to 4 in importance to their sample of 100 railroad 
workers, using as criterion the magnitude of the difference between per- 
centage of positive answers returned by the 50 high and 50 low satisfaction 
groups respectively. The authors call for more study of satisfaction at the 
on-the-job level: that is, personal relationships, rewards and punishments 
directly connected with the day’s work. Weitz (116) puts forward the thesis 
that a worker’s stated sources of job satisfaction are more meaningful if we 
can get some idea of how generally dissatisfied he is with everyday life, and 
illustrates it with some preliminary results. It occurs to the reviewer that the 
common element here may prove to be the extent of instability or maladjust- 
ment in the personality of the individual worker. In a brief summary of 
validation data on his Tear Ballot, Kerr (70) provides a small but useful 
bibliography of job satisfaction studies including some as yet unpublished. 
One of these, by Hoover, is quoted as showing the correlation between mean 
job satisfaction of each of 26 work groups at the beginning and end of a 
three years’ period to be statistically insignificant. Without the full data this 
is difficult to evaluate, but it is to be hoped that the study will eventually be 
published since it may raise issues of importance both to research workers 
and to practitioners. 

Robinson & Hoppock (89) continue the useful series of summaries of job 
satisfaction researches, providing a 23-item bibliography. They note a trend 
‘in the direction of a probing for individual adjustment and group adjust- 


ment causes,’’ commenting that ‘“‘many researchers... appear now to ac- 
cept the thesis that such factors as job security, wages, physical working 
conditions, etc., are symptoms... not causes in themselves.’’ Probably the 


most striking result of an as yet unpublished study by Kristy (74) is “‘the 
clear determination that there is almost no relationship between proficiency 
and satisfaction among (British) post office counter clerks.”” By means of a 
group factor analysis of the combined matrix of proficiency and satisfaction 
variables, he finds that proficiency divides into (a) general ability, and (6) 
reliability defined as ‘‘getting along effectively without having to bother the 
supervisor”; and that satisfaction divides into (a) pleasant feeling towards 
the job and (b) satisfaction with the physical demands in relation to the re- 
wards. Support for the latter dichotomy appears in some recently completed 
and as yet unpublished results obtained by the reviewer in a study of autobus 
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conductors. Bernberg (13) found no significant relationship between assess- 
ments of morale obtained by questionnaires from 890 workers in an aircraft 
factory, and indicators of their efficiency. A supervisors’ merit rating ac- 
counted for 42 per cent of the variance of the efficiency indicators, and the 
four measures of morale intercorrelated highly. It is therefore difficult to see 
why Bernberg used a combined score for efficiency but the four separate 
scores for morale. The former is described as the sum of the scores of the 
specific indicators. Apart from the fact that this does not indicate whether 
standard or other transformed scores were summed, it raises the basic ques- 
tion whether all the distributions involved were suitable for combining. 
Merit ratings, for example, may be well distributed and subject only to mild 
skewness in favourable circumstances with well-constructed scales, while 
frequencies of absence and lateness are notoriously J-shaped, especially in a 
short period such as 20 weeks. Papers of this kind lose a great deal of their 
potential interest and value to those reading them unless authors are forth- 
coming on points such as those raised. 


MISCELLANEOUS TOPICS 


Rehabilitation —Maule (81) used some of the governmental and private- 
enterprise industrial rehabilitation services of Great Britain as a medium for 
trying out the project method of teaching among a mixed postgraduate 
group of psychologists and medicals. Apart from comments on the usefulness 
of the project, Maule makes three points about the current trends in re- 
habilitation which are of interest. First, possession of the same specific dis- 
ability is an unsound basis for classification; second, physical handicap is 
often overemphasised at the expense of occupational abilities; third, rehabili- 
tators are apparently rather antiscientific so far as validating their work is 
concerned. 

Ageing.—Problems of ageing received attention from Tuckman & Lorge 
(109) and from Kirchner, Lindbom & Paterson (71), who tackled the ques- 
tion of attitudes towards employment of older people, the former authors 
among graduate students and the latter among laundry rank-and-file workers 
and supervisors. Graduate students are reported as accepting erroneous ideas 
about older people more than would be expected of such a selected popula- 
tion, while in the laundry (and in a factory used for cross-validation) the 
neutrality of the under-30’s among rank and file contrasted with the increas- 
ing favourableness in older age. This is not as platitudinous as might appear 
at first glance, the attitude of the under-30’s being the critical element and 
the one to which further attention should be given. 

Work environment.—Attention may be directed to three papers on per- 
ceptual aspects of the work environment. Spragg & Rock (100) contribute 
two careful studies of brightness levels in dial readings, showing that such 
levels should be kept above the critical range 0.02 to 0.05 foot-lamberts but 
that further increase adds nothing to error reduction and little to time reduc- 
tion. Use of red lighting requires these limits to be exceeded, but not beyond 
0.1 foot-lamberts. The first paper recognises limiting conditions not provided 
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for in this experimental design and requiring further research. Zaccaria & 
Bitterman (124) used critical fusion frequency as a measure of fatigue when 
investigating the effects of fluorescent lighting on both A.C. and D.C. sup- 
plies. They concluded, apparently with justification, that single-lamp or in- 
phase multiple-lamp fluorescent installations are undesirable from the point 
of view of objective fatigue. No effects on speed of reading were observed. In 
an interesting paper describing the development of a new indicator of ma- 
chine tool travel, Gibbs (45) stresses the way in which existing methods of 
display often dominate industrial tasks, and in the reviewer’s judgment 
rightly emphasises that producing a practical working model is likely to 
achieve change much more quickly than verbal recommendation. 

Accidents and absence.—Maguire, Pearson & Wynn (78) give examples of 
the use of various statistical tests in studying the homogeneity or hetero- 
geneity of the time intervals in industrial accident data. They draw the 
analogy between quality control and accident control statistics but rightly 
stress the important differences between the two fields of study. That old 
veteran ‘“‘accident-proneness”’ continues to receive attention, Webb & Jones 
(114) reporting that the index of reliability (demonstrating proneness) can 
be obtained equally well by the Poisson-fit and by the correlational ap- 
proaches (incidentally still of the order of .2 on fresh data!), while Adelstein 
(2) continues the 1951 South African onslaught of Arbous & Kerrich (5) ina 
long paper followed by contributions from such well-known figures as Irwin 
and Chambers. The last two papers (Adelstein; Arbous & Kerrich) are now 
obligatory reading for all future investigators. Frank (37) gives details of the 
use of a new but unpublished technique developed by Arbous and Sichel for 
the analysis of absence data. Briefly the method is to establish ‘‘absence- 
proneness”’ in a given group of workers by correlating absences in a given 
year with those in the next; if this correlation is significant, to use the bivari- 
ate distribution for predicting future absence by means of a mathematical 
model based on the negative binomial distribution. Frank’s data are en- 
couraging for the use of this approach, and the paper is likely to be of con- 
siderable interest to those concerned with studies of absence. 


SUPERVISION 


Evaluating procedures—A promising dimensional approach to the objec- 
tive assessment of supervisory behaviour comes from Fleishman (36), and it 
is to be hoped that others will give his Supervisory Behavior Description 
Form further trial, especially in situations where external criteria are avail- 
able as validation. 

File’s ‘‘How Supervise?”’ has attracted attention from several sources. 
Millard (83) follows up the lead given by Slocum, Mosier, Harrell, and 
Sartain concerning the relationship between ‘‘How Supervise?’’ and general 
mental ability. It seems a pity that he used the Adaptability Test as a 
measure of intelligence following Sartain and more so that Carter (24), who 
quotes no references, fails to publish his obtained correlations between ‘‘How 
Supervise?”’ and the Otis test. Wickert (120) gives a correlation with the 
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language-factor score of the California Mental Maturity Test, calling it 
“verbal intelligence.”” When a matter is a live issue it gives cause for regret 
that authors are not especially conscientious in making available every scrap 
of relevant data for the use of others. As it is, all we now know from these 
studies is that ‘‘How Supervise?” is almost certainly correlated significantly 
with word knowledge and educational level, especially if a wide range of 
these is involved, but we do not know whether it is affected by general 
mental ability. Whether it is an intelligence test or not, Part III of it corre- 
lated .65 with a rating of supervisory ability by fellow supervisors for two of 
three small groups of supervisors studied by Carter in a metal plant. This is 
one of several studies noticed this year where authors have apparently been 
unable to resist the temptation of proceeding to the use of multiple R on 
samples ranging from 12 to 20 cases. Incidentally Carter would be hard put 
to it to explain why, for the group of supervisors whose ability was not pre- 
dicted at all by ‘‘How Supervise?” an r of .66 was found with the “‘Artistic” 
section of the Kuder Preference Record! 

Training of supervisors.—The scanty research literature on training re- 
ferred to by previous reviewers receives a welcome addition from Castle (25) 
in a paper reporting the development of two attitude scales and a role playing 
test for use in evaluating supervisor training procedures. Although the results 
of their preliminary use are inconclusive, as a contribution to methodology 
the article merits close attention by those better equipped with facilities for 
development and further research. Principles of Human Relations is the title 
of Maier’s textbook (80) contribution in this field. Although it contains some 
material previously published in journal articles the bulk of the content is 
fresh. For those to whom the expression “human relations’ has become 
anathema, this text is recommended reading if only because this author has 
his feet firmly planted on scientific ground. He gives reasons why the training 
techniques he describes justify serious attention and does not fail to relate 
his case material to his general psychology. Maier also provides evidence that 
he at least has not been blinded by success to the need for further well- 
planned research, suggesting ways in which training situations using role 
playing can be valuable opportunities for a variety of research purposes from 
the influencing of opinion to the effect on decision making of various kinds of 
participants. This book is a welcome contrast to the vague and verbose 
affirmations about human relations which have been the cause of much 
understandable resistance to this development in the field of scientific in- 
dustrial psychology. 

Psychology of Industrial Relations by Lawshe et al. (75) is “directed spe- 
cifically at industrial people in supervisory and managerial ranks.’’ Its com- 
petent and up-to-date nontechnical coverage of employee-centred industrial 
psychology would make it a useful handbook for supervisors and managers 
already convinced by Maier’s programme, but it may be doubted whether 
its succinct summaries of good practice would cut much ice with the uncon- 
verted autocrats. Even if they responded to the businesslike appeal to use 
psychologically right methods in order to make employees happier in order 
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to raise productivity in order that we shall “‘all have more,” it is doubtful 
whether they could wear their sheepskins convincingly enough to achieve the 
objective. A not unreasonable analogy is the attempt to promote happier 
family living by buying Dad the right sort of book at Christmas. His problem 
is to get into the state of mind where he can do what the book says! 


ATTITUDES AND MORALE 


Vigorous criticism of that Hawthorne experiment known as the “Relay 
Assembly Test Room’’ comes from Argyle (6). Those concerned with the 
planning of fresh studies of working groups would be well advised to consult 
this paper whether on doing so they agree with Argyle or not. It is at least 
refreshing to see someone at last calling a halt to the procession of per- 
functory references to the Hawthorne studies and their principal generalisa- 
tions. A good looking paper on group productivity, complete with all the 
statistical data one now feels able to expect, laid out in a neat businesslike 
fashion, and dealing with a topic of current interest, is provided by Darley, 
Gross & Martin (28). The work reported is described as the first in a series of 
studies of group behaviour. There is a criterion of productivity, the depend- 
ent variable; there are 18 independent variables to predict it, a table of rank 
order (rho) correlations; and a choice of five multiple R’s using different 
(selected) combinations of two, three, or four variables. The discussion is 
sophisticated both theoretically and statistically, including a half-column on 
the assumption of linear and additive relations between the predictors and 
the criterion. This reviewer became deeply interested in considering how the 
authors might have avoided the element of subjectivity involved in selecting 
the variables for their reported multiple R’s, and as a result he at first com- 
pletely overlooked the crucial fact, clearly stated by the authors, that this 
monumental structure rested on a frail foundation of 13 cases. Gekoski (41) 
in a related paper also used multiple R, on an N of 21, but largely prevents 
overenthusiasm by twice repeating that the results cannot be more than 
suggestive. These papers draw attention to the methodological problems pre- 
sented to the well-trained psychologist by the implicit difficulties of obtain- 
ing large samples of groups. It is the opinion of others beside this reviewer 
that the right way to face these problems is to exercise restraint in the use of 
statistical procedures which demand for their valid use resources which we 
often cannot obtain. 

The Man on the Assembly Line by Walker & Guest (113) is a cleanly 
written report of a well-planned attitude survey carried out in a new auto- 
mobile plant. It will undoubtedly be read by those who get a copy into their 
hands, and if some research workers and teachers in the field of industrial 
psychology feel a bit lost without all the tables they are used to, the novelty 
may do no harm. This investigation is similar in some respects to several 
carried out in Great Britain during the last 10 years by teams of the Medical 
Research Council, but the principal differences are of some interest. Walker 
& Guest’s interviewers reached all their sample at home on the telephone and 
saw their subjects at home by appointment. In Britain, assembly line work- 
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ers seldom have or aspire to telephones and tend to resent home visits any- 
way. Over 90 per cent of the American sample had received more than 
elementary schooling, half of them having graduated from high school. Even 
allowing for their being atypical of American assembly workers, they would 
still contrast with their British opposite numbers, almost all of whom would 
have finished school at 14 or 15. Such divergencies underline the problem of 
generalisation where all such studies are concerned; but where findings are 
the same in spite of such social and educational differences we shall all the 
more confidently add to our general fund of firm knowledge. 

In a survey of worker attitudes in seven factories of a company operating 
a scheme of establishment for wage earning employees which accorded quasi 
staff status and accompanying security, Denerley (31) repeated the frequent 
finding in attitude surveys that the aims of the scheme were not understood. 
Psychologists seem nearly as bad in communicating to employers the re- 
search findings about poor communications as are the employers themselves 
at communicating. Turning for the moment to the more mechanical aspects 
of the problem of communication, mention should be made of a paper by 
Twedt (110) who lifts evaluation of readership to a more sophisticated but 
still convincing level in his competent study of 34 advertising variables in- 
cluding Flesch readability and abstraction scores. He found that pictorial 
colour and size factors could account for more than half the variance in 
readership scores, that size plus number of colours plus square inches of 
illustration gave an R of .77 with readership, and that the regression equation 
stood up well when applied to six other business magazines. This approach 
seems promising but Twedt realises that it needs to be applied to other types 
of advertising location before any generalisation can be made. If it be estab- 
lished that the variables he names are the important ones, the social psy- 
chologists can get busy on the implications. 

In the second of two studies, Comrey, Pfiffner & Beem (27) provide inde- 
pendent confirmation for the findings by Katz et al. (65, 66) that employees 
in more productive work groups tend to have greater pride in their work 
group. These now well-established facts should be considered in close relation 
to other data concerning the small relationship existing between efficiency 
and job satisfaction. We are evidently still a long way from understanding 
the composition of morale. 


LEADERSHIP AND MANAGEMENT 


Van Zelst (125) and Tobolski & Kerr (106) provide a little more data 
about the Empathy Test, the former reporting correlations about .7 with 
leadership ability among 64 trade union business agents; the latter an 7 of .7 
with new car sales success. One wonders what the average correlation would 
be between the two qualities if they could have been tested within the two 
populations studied! A well-planned study by Jackson (63) extends the Bell 
& French (11) laboratory study of leadership in small groups to a real life 
setting among telephone installation and repair sections. Like them he found 
that leadership skills satisfactory to one group proved consistently so in 
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another, redirecting our attention once again to the personality of the group 
leader and away from the personality needs of those led. Jackson is probably 
right, however, to underline the contrasting fact that his data show attitudes 
about other aspects of the job to be more related to the personal frame of 
reference of the respondent. 

Westerlund (119) gives rather profuse details of a well-designed study 
carried out in the Stockholm long-distance telephone exchange. Co-operation 
of management, unions, and operators was obtained for an experiment in 
which a system of group leadership was substituted for the traditional pat- 
tern of functional supervision. The first-named characterises current British 
and American practice; the latter, still in use in Sweden, involved the opera- 
tor in responsibilities spread over a large number of superiors. Apart from 
the successful practical outcome (management agreed on the basis of the 
results to start a change-over to group leadership) the report may represent 
an important landmark in Scandinavian industrial psychology. The bibliog- 
raphy is comprehensive and remarkably up-to-date, and the statistical meth- 
ods employed in testing a series of hypotheses are appropriate for the data 
involved. It is unfortunate that rather too much tabular material has been 
included, most of which could with profit to the text have been omitted or 
relegated to the appendix. Browne & Neitzel (17) have experimented with 
the Stogdill & Shartle scales for investigating responsibility, authority, and 
delegation of authority, relating results to the Harris ‘‘Morale Scale.’’ Some 
ingenious comparison scores were devised and this article may be said to 
represent a necessary but tentative stage in the process of quantifying rela- 
tionships which have hitherto been largely verbalized. This gives rise to a 
point which it is hoped editors may feel able to note, that until new and 
exploratory techniques are well-known, authors using them should be asked 
to do more than quote four-year-old references in lieu of giving summarised 
descriptions illuminated by examples. 

During the year, the National Institute of Industrial Psychology pub- 
lished the results of a government-sponsored survey of Joint Consultation in 
British Industry (85). While it is undesirable to attempt a summary of this 
work here it should be said that many of its findings will be of interest to 
others outside the United Kingdom. The difficulties of the industrial psy- 
chologist in Great Britain may perhaps be estimated from the fact that of 
the 4719 manufacturing establishments approached by questionnaire only 
751 replied. It was found that the foremen were the least favorable to joint 
consultation as it is generally constituted; only 39 per cent actively sup- 
ported it while 47 per cent accepted it and 14 per cent resented it. In a 
sociological study of joint consultation in three British factories, Scott (96) 
comes by different methods to somewhat similar general conclusions as does 
the National Institute of Industrial Psychology survey. His suggestions for 
further work include ‘‘determining whether there is a particular type of social 
structure within which both high morale and high productivity co-exist . . . 
(and) why the behaviour of different leaders differs.’” On the methodological 
side, Scott supports the group discussion as a research technique but realises 
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the need for further experimentation in methods of quantifying the data it 
provides. Rose (92) lists 23 hypotheses which he suggests might be used as a 
starting point for rigorous investigation of mediation techniques. These hy- 
potheses arose from his first-hand studies of mediation procedures. 

Mace (77) describes the widespread provision in the United States of 
education for management at the university or college level using the Har- 
vard Business School as a leading example. After indicating in a shrewd and 
interesting way the sources of difficulty inherent in the philosophies of 
British university and industrial leaders, he expresses his belief that institu- 
tions such as he describes are a desirable development. In the reviewer's 
opinion Mace is right to emphasize that even in the United States there 
seems need for closer collaboration between the business schools and other 
departments in which psychologists are engaged in research. 

In the report by Scott just referred to he says: 


the operational role of management rather than the mere presence of higher manage- 
ment at meetings .. .is an important point which appears to have been neglected in 
the literature. The paternalistic tradition . . . underlies the behaviour of top manage- 
ment and is responsible for the fact that consultation is not really “joint” ... and 
why they (employee representatives) are preoccupied so exclusively with “welfare.” 


Davis (30) reports an unusual study by independent investigators of the 
merit rating scheme used by a small factory employing 500 workers. The 
supreme importance of the personal values and behaviour of the senior execu- 
tives is described in connection both with the inception and installation of 
the scheme and in attempting toaccount for the health of the working commu- 
nity. Davis suggests in conclusion that ‘‘there are perhaps more gaps in our 
knowledge of why men work hard and contentedly than of why they do not.” 

Those concerned with the problem of administrative communication 
should read at least the final chapter in Ronken & Lawrence, Administering 
Changes (91). The authors, after presenting a case study of the introduction 
of a new manufacturing process, list five tentative generalisations about the 
way in which communication is dependent both upon skill and circumstance. 
They relate those to a central concept, “the need for self-awareness in the 
administrator” and make it clear that they regard this as an attitude which 
can be learned. As this process of learning involves alteration of social per- 
ceiving to a degree which requires a great deal of the individual in the way of 
motivation and security, there is obviously an important task for the psy- 
chologist here, provided he is the whole man rather than the preoccupied 
specialist in some narrow field. 

Another example of the case-study approach is Scott & Lynton’s Three 
Studies in Management (95). The special value of this book may lie in the fact 
that the three studies present such variations on the theme of change that 
the reader is at once impressed by the unlikelihood of a generally applicable 
formula for success. Those especially concerned with the psychological prob- 
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lems of selection and of training which may arise when a factory is built for 
an industry new to an old community will find one of these studies especially 
interesting, while another study shows in a striking manner what can be 
done when management is really permissive and reassuring in its attitude 
toward the training programme. In an appendix the authors are downright 
in their views about the methodological problems of social science. They are 
fully justified in asking whether the attitude questionnaire responses of 
““Yes”’ are necessarily equivalent in significance from Smith to Jones, especi- 
ally when they give clear indications of their awareness that “the degree to 
which detail can be usefully abstracted varies with the purpose for which the 
data are to be evaluated.” But their hesitations about what might be called 
a behavioural view of questionnaire responses extend further: to observations 
by a trained investigator and to the recollections of a local participant source 
of second-hand information. In the two latter there is the real problem of 
significance by selection which they imply is also true of ‘abstraction by 
statistics.’’ Such an implication is by no means necessarily correct. Whether 
it is or not will depend in the event upon the appropriateness of the statistical 
methods to the purpose in hand and on the skill and integrity of the users of 
such methods. This said, let it be added immediately that much of this ap- 
pendix on research method can be read with considerable profit by most in- 
dustrial psychologists. 

The basic problem, that of changing the attitudes of top management, 
referred to explicitly by Maier (80) and largely ignored by Lawshe (75), may 
prove pretty intractable in the present generation. However, it has been 
tackled in advance by the Harvard Business School, which for some years 
has been turning out young executives familiar with these needs and ex- 
perienced in facing them in their own persons. The case-method of teaching, 
of which we have read in articles and reports of addresses, is now set out 
fully in the Case Method of Teaching Human Relations and Administration 
edited by Andrews (4). A careful reading of this book will probably convince 
even the most sceptical that something of profound importance is being done, 
and that those who are doing it are prepared to admit the difficulties frankly. 
There are three sections to this symposium. The first, ‘‘Teaching and Learn- 
ing,’’ makes clear the theory, spirit, use, and consequences of the case method 
at Harvard Business School. The second indicates and illustrates its exten- 
sion into industry. The last, ‘‘Research Problems in Human Relations,”’ 
stakes a claim for the ‘‘clinical method” which is sufficiently well-argued to 
merit serious consideration even from the toughest of tough-minded critics. 
The principal representative of the clinical method in the book is Lombard 
(pp. 225 to 240) who says baldly ‘‘. .. the techniques of the controlled ex- 
periment are not what I wish students of human behavior would learn from 
the natural sciences.’’ Before dismissing him and his colleagues as unscien- 
tific, one would do well to examine the reasoning with which he supports this 
statement. 
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When commencing this review, the writer made no apology for providing 
a reminder of what has gone before, believing as he does that during a forced 
march someone from time to time must accept responsibility for glancing 
over his shoulder and reporting what he sees. An inspection of the con- 
temporary scene and our adaptation to it is at once depressing and encourag- 
ing. We still spend too much time producing new tests instead of improving, 
adapting or re-analysing old ones. Many of us are still lopsided in our atti- 
tude to and use of statistical procedures. We still often fail to use our psycho- 
logical insight in the use of our techniques, the teaching of our students, the 
writing of our books and articles, or the conversion of our employers in uni- 
versity, government, or industry. We write too much, talk too much, and 
think too little. On whichever side of the Atlantic we live, we are too prone 
to ignore or to belittle the products of the other. 

It is encouraging to see in the results of the year under review that at long 
last the criterion is receiving the attention it merits; that training, both of 
supervisors and of executives, is now a major interest; and that studies of 
leadership and group behaviour are beginning to be the objects of more 
scientific enquiry. Catch-phrases such as ‘‘accident-proneness,”’ ‘‘the one best 
way,” and “the intuitive approach” have been vigorously attacked with 
possibly mortal consequences, while human relations has been rescued from 
the bog into which it had been dragged. 

In a paradoxical age of passing fashions and repetitive movements it is 
difficult to note significant trends with any confidence. Perhaps it will be 
more profitable to consider our professional status and aims and to make ex- 
plicit some of the immediate needs. Bingham (14) reminded us that we were 
in serious danger of being torn apart. Psychology is a branch of scientific 
enquiry, a means to selected ends, and a personal skill. There is current 
evidence that industrial psychology lacks integration. Workers in this field 
can too easily be either scientific or practical, rigorous or realistic, academic 
or practising. This suggests that present and pressing needs are of two dis- 
tinct kinds, each more appropriate to some of us than to others but both 
relevant to the work of all. 

One need may be met if those who as yet do not know industry well were 
to face up to some questions posed during the year by Gorsuch (48): (a) Why 
do psychologists have such difficulty in gaining acceptance in industry? (d) 
How do psychologists spend their time in industry? (c) What can we do to 
provide better psychologists for industry? (d) How well are psychologists 
equipped to solve business problems? Are we getting too smug? Do we have 
the necessary tools? If not, what are we doing about it? 

The other need has been outlined by Flanagan (35) who, when wishing 
to advance some suggestions for research on what he calls “important social 
problems,”’ significantly felt it necessary to begin with a summary restate- 
ment of the basic ingredients of scientific study. Going on to develop his 
main theme he said: 
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This does not mean that I prefer applied research to basic research. There seems 
adequate evidence at the present time that work on applied problems can lead to the 
discovery of basic scientific knowledge. The terms in which the problem is formulated 
and the general research methods used are the determining factors in whether a re- 
search study yields a specific result or one which may be generalised. 


In view of the current emphasis on research into the social aspects of in- 
dustrial adaptation, we may perhaps fittingly end by considering a view 
expressed by Drucker (32). He calls for research to be directed at the rela- 
tionship between men, the group, and the actual job they are doing, and 
suggests that overemphasis on the human relations approach has distracted 
attention from the demands made by the job itself on the individual. This 
corrective is timely; we may need to be reminded that, important though 
they are now, at last, seen to be, social pressures are not the only ones. We 
need a balanced approach, the recollection that ‘‘psychology developed its 
applied areas because of public demand that the psychologist contribute his 
knowledge and skill to efforts to solve some of the urgent problems of man in 
a complex society’’ (3). We can agree with Drucker when he insists that in 
addition to more adequate methodology, the most urgent need is for better 
basic concepts; it may well be that as with psychology generally, so with 
industrial psychology in particular, the time has now come when most of us 
must ask those who support our work to permit a pause while we think. 
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COMPARATIVE PSYCHOLOGY'” 


By RoGER W. RUSSELL 
Department of Psychology, University College, London, England 


A chapter on Comparative Psychology differs in certain very significant 
ways from other chapters included in the Annual Review of Psychology. 
Basically it is unique in the sense that the term ‘‘comparative psychology” 
refers to a method of approach rather than an area of problems. It may be 
concerned with problems pertinent to most of the other chapters in the 
Review; its coherence lies in the application of the logic underlying the 
comparative methods of science. Schneirla (1) in considering some con- 
ceptual trends in comparative psychology has made this point a major issue 
by commenting that: ‘‘the practical consideration of paramount importance 
here is not how many types of animals we use in our investigations, but 
whether or not we use the comparative methods of science in working 
with them.”’ 

The logic of the comparative methods is based upon critical comparisons 
of phenomena that are truly homologous rather than merely analogous. The 
term “homology” implies that the phenomena are alike in origin and in 
fundamental structure. Russell (2) in considering the development and use 
of this logic has indicated how widely it has been applied in the study of 
language and communication; of social behavior, culture patterns and 
social institutions; as well as in attacks on problems of interest to the 
biological sciences. Because of its nature this comparative approach of the 
study of behavior is in no way limited to comparisons among individuals or 
groups differing phyletically, although in certain professional groups it has 
come to have such a meaning. Psychologists in some countries still under- 
stand it as covering many of the problems in child, abnormal, and social, 
as well as animal, psychology. Regardless of the basis upon which the groups 
compared have been classified comparative studies have in common a search 
for similarities and differences in the homologous behavior phenomena of a 
variety of living organisms. 

Because comparative psychology is defined by method rather than 
problems peculiar to its investigations it does, to some extent at least, cut 
across the classification scheme upon which other chapters of the Review 
are based. Its findings can, and should, be considered in other chapters. 
Authors of this chapter in previous Reviews have recognized this and have 
set a precedent of limiting their writing mainly to information on problems 
for which the study of infrahuman animal behavior is particularly adapted. 
The present chapter will conform to this precedent. 


1 The material in this review covers approximately the period from April 1, 1952 
to April 1, 1953. 


2 The following abbreviations are used: ACTH for adrencorticotrophic hormone; 
ESC for electroshock convulsion. 
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EXPERIMENTAL PSYCHOGENETICS 


In 1951 Hall (3) brought to the attention of psychologists the potential 
values of research into the influences of genetic factors on behavior. His 
review of the present status of psychogenetics revealed how little systematic 
experimental work has been done in which the orientation of the research 
design has been that of the student of behavior. 

Experimental psychogenetics is a field of research which must depend 
upon the use of infrahuman subjects (4). It is limited by the fact that man, 
with his particular culture patterns and long life span, cannot be employed 
as a subject for research. Experimental attacks on problems in this field are, 
therefore, the real responsibility of the animal psychologist. 

Stob & Andrews (5) have shown how certain constitutional features 
such as body weight, thyroid weight, and histology, and gonad weight and 
histology respond differently in different inbred lines of mice to thiouracil 
and thyroprotein. The effects of ingesting the latter at the level of 0.22 per 
cent of the diet was to increase body weight in some lines, but to decrease it 
significantly in others. The seven lines studied could be divided into three 
size groups and the effects of thiouracil in small amounts (0.05 per cent of 
diet) were to depress growth in the large lines and not in the medium or small 
lines. Significant interactions between body size and treatment, sex and 
treatment, and line and treatment were found with respect to thyroid 
weight. No consistent effects of the drugs on gonad weight or histology were 
noted. There are a number of studies in the literature of psychogenetics to 
show that complex behavior patterns may be influenced genetically, either 
in whole or in part, but none of these studies is more striking than Hauschka’s 
(6) report on mutilation patterns and cannibalism in mice. The incidence 
of parental cannibalism in animals from eight different strains kept on the 
same adequate diet, ranged from 0 to 45 per cent. In the subline where this 
behavior was most frequent it showed little fluctuation during 13 sibmated 
generations. Mothers of this subline restricted themselves to anatomically 
defined mutilation and amputation patterns and often reared their uni- 
formly injured infants. Estes (7) has reported significant correlations be- 
tween the racing class of high-spirited thoroughbred broodmares and off- 
spring and of phlegmatic broodmares and their offspring. Although concerned 
with quite different characteristics of living organisms these three studies 
illustrate the first of four major steps experimental psychogenetics is taking: 
the determination of whether or not specific behavior patterns are influenced 
by genetic factors. 

The second step begins when such influences have been established and 
undertakes to study the number and nature of the genetic factors involved. 
Hauschka (8), who appears to enjoy discovering genetic oddities, has re- 
ported observations of ‘‘whisker-eating’’ mice, whose performance followed 
a precise working pattern. Animals with this behavior anomaly were found 
in several heterozygous genotypes and in two inbred strains, but not in five 
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other pure lines. All animals had been maintained on the same diet. From 
his observations Hauschka has concluded that multiple factors may be in- 
volved in influencing this behavior, single factors being definitely ruled out. 

The third step in experimental psychogenetics is concerned with the 
location of the gene or genes on the chromosome. Falconer (9) has shown 
how linkage between characteristics can aid in such mapping. In the house 
mouse, ‘‘reeler’’ is a recessive gene which impairs locomotion through a 
disturbance of balance. By proper breeding procedures Falconer has shown 
this gene to be linked with two other genes, whose loci, in linkage group iii, 
were already known. 

The old problem of interaction between genetic and environmental 
factors often has given rise to queries which, in the opinions of many, have 
turned out to be pseudoquestions. Recently Falconer (10) has stated the case 
for a real problem of environment and selection. In his opinion most genet- 
icists would argue that performance in a favorable environment has a dif- 
ferent genetic basis than performance in an unfavorable environment. A 
superior genotype in one environment could not be expected to be superior 
in a different enviroiment. It seems probable that the student of psycho- 
genetics will eventually have to face up to interactions of this kind. 


EFFECTS OF EARLY ENVIRONMENTAL CONDITIONS ON 
LATER BEHAVIOR 


Several systems in psychology have emphasized the important effects 
early environmental conditions may have on later behavior, particularly 
the appearance of behavior disorders. Recent experiments have studied 
effects produced by three general classes of conditions: those affecting body 
structure or function, intense stimulation, and ‘‘differential environments.” 

An experiment by Levinson (11), to be described more fully later, 
examined the effects of different amounts of x-irradiation at one of five 
stages of gestation on the maze learning performance of 50-day old rats. In 
terms of the usual measures of maze learning, performance was found to 
decrease as a function of the amount of fetal irradiation given. It was also 
noted that variability in performance increased with treatment. Armitage 
(12) gave varying dosages of the barbiturates, sodium barbital, and sodium 
pentobarbital, to pregnant female rats 19.5 days after breeding and studied 
the performance of the offspring in two mazes and one reasoning situation. 
Comparisons between the performances of drug and control groups indicated 
significant reductions of performance in the former. Two studies have been 
concerned with the effects of early anoxia on later learning performance. 
Becker & Donnell (13) asphyxiated guinea pigs in utero by occluding the 
maternal uterine vessels after one animal in each litter had been delivered 
as a control. One group of animals was released in time sufficient for spon- 
taneous recovery; a second group was asphyxiated until comatose, totally 
apneic, and atonic, and then artificially revived with gas mixtures. Learning 
performance was tested 8 to 10 weeks later in problem boxes of two levels 
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of difficulty. Analysis of the learning data for both levels of difficulty re- 
vealed significantly greater error and perseveration scores for the experi- 
mental as compared with control animals. Retest two weeks after learning 
showed retention to be affected in a similar manner. Hurder & Sanders (14) 
divided their rats into control and experimental groups at approximately 
one hour after birth. Two experimental groups were then placed in a decom- 
pression chamber, where one remained for 3 hr. at a simulated altitude of 
30,000 feet and the other, for 38 min. at 34,000 feet. At 100 days of age, 
control and experimental groups were trained on a Stone multiple-T alley 
maze. The learning scores of the three groups were not significantly different. 

Griffiths & Stringer (15) reported an experiment on rats designed to 
determine the effects of intense stimulation during infancy on adult be- 
havior. One group of animals served as controls, and four experimental 
groups were exposed, each for a 3 min. period, daily during the first nine 
days of life and twice daily until the fifty-seventh day, to one of the following 
forms of stimulation: intense auditory stimulation, rapid rotation, extremes 
of environmental temperature, and electric shock. Beginning at 60 days of 
age, all groups were tested in four different situations: the Warner-Warden 
maze, a modified Lashley discrimination apparatus, the Hall open-field test 
for emotionality, and a test chamber for audiogenic seizures. No significant 
differences appeared between the performances of the experimental and 
control groups. 

Three reports examine the effects on the later behavior of early experi- 
ence in differential, or varied, environments as compared with restricted 
environments. Although the specific nature of the environments varied in 
the three studies, in all cases they were arranged on the one hand to allow 
wide access to a variety of different features and conditions, and, on the 
other, to restrict movement to a very limited field such as a laboratory cage. 
Bingham & Griffiths (16) reared groups of rats in such contrasting environ- 
ments for 30 days, beginning at 21 days of age. Two weeks later tests began 
in five different situations: the Warner-Warden maze, an inclined-plane 
maze, a modified Lashley jumping apparatus, the Hall open-field tempera- 
ment test, and a test chamber for audiogenic seizures. Measures of the be- 
havior elicited in each of these situations showed that animals reared in the 
free environments were superior to those reared in the restricted environ- 
ments in maze-learning performance only, no significant differences appear- 
ing in the other measures taken. Hymovitch (17) blinded his rats, some in 
early life and others at maturity, and reared them either in a ‘“‘free-environ- 
ment box’’ or in environments allowing various amounts of restricted 
activity. At maturity the animals were tested on the Hebb-Williams closed- 
field apparatus and on an enclosed, 10-unit T maze. Differences between 
early and late blinded animals were not significant nor were the performances 
of the free and restricted groups on the T maze. However, the free groups 
were superior to the restricted in their performance in the closed-field test. 
Two additional groups were studied. In one, free-environment experience 
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was received during early life and restricted-environment experience later 
on; in the other, this order of experiences was reversed. In the adult tests the 
former group was clearly superior to the latter. Forgays & Forgays (18) in 
an experiment of similar design to those just described have obtained similar 
results. Attempts to explain the results of these experiments have been in 
terms of differential opportunities during rearing for perceptual experience 
and learning. 


SENSORY PROCESSES 


During the year covered by this review research continued on traditional 
problems in sensory processes, with an occasional suggestion of a new 
problem or a modification of technique. Fingerman (19) and Fingerman & 
Brown (20, 21) reported the results of three experiments designed to study 
color sensitivity in Drosophila melanogaster. These investigators interpreted 
their evidence as indicating that Drosophila possesses a true color vision. 
The evidence comes in part from observations of the strength of phototaxis 
in response to white and monochromatic lights following differential 
adaptation of the eyes to various light stimuli. Response to colored light was 
lost with reduction of the intensity of the stimulus, suggesting the existence 
of photopic and scotopic mechanisms as in the vertebrate eye. Because the 
genetics of Drosophila are so well established it was possible to select six 
groups of animals differing presumably in eye coloration only. Using photo- 
tactic orientation in response to 17 selected wave lengths between 366 and 
700 my, spectral response curves were plotted for each of the groups. These 
curves were found to differ not only in over-all strength, but also in form, 
and appeared to be dependent upon the differential absorption of light by 
the various eye-pigments. The suggestion that the duplicity theory may ap- 
ply to the compound eye is further supported on the anatomical basis that 
both long- and short-axoned retinula cells have been found in several Arthro- 
poda. A final study in this series was designed to search for a Purkinje shift 
in Drosophila vision. The investigators observed the way in which the 
spectral response curve altered as the intensity of light was decreased. Maxi- 
mal response to monochromatic light of relative intensities of 1 and 1/10 
was in the region of 520 to 540 my and for relative intensities of 1/100 and 
1/1000, in the region of 480 to 500 my, a shift of approximately 40 muy. 
These studies furnish interesting comparative data relating certain processes 
in the compound eye with those in the differently structured vertebrate eye. 

Montgomery & Heinemann (22) have argued that if von Frisch’s findings 
on polarized sky light, as a basis of navigation in bees, is applicable to homing 
birds then the latter should be able to discriminate readily between patterns 
of light polarized in one plane and patterns polarized in a plane orthogonal 
to the first. Using homing pigeons as subjects, they presented the required 
stimulus patterns on a ground glass stimulus key. Responses to polarized 
light in one plane only were rewarded. The birds were maintained at 80 to 
85 per cent of their normal body weights during acquisition and extinction 
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trials. The results gave no evidence of discrimination between the planes of 
polarized light. On the other hand, brightness discrimination was learned 
very rapidly. 

Dice & Barto (23) have reported on the ability of mice (Peromyscus) to 
hear ultrasonic sounds. Two sources were employed, one ranging from 500 
to 16,000 c.p.s. and the other from 10 to 100 ke.p.s.; sound pressures of 1, 
5, 10, or 20 dynes per cm.? at the animal’s ear were used as desired. Activa- 
tion of conditioned responses indicated supraliminal stimulation effective 
over a range from 500 to 95,000 c.p.s. in animals 25 to 70 days old, this 
range tending to decrease with age. 

Another sign that research on animal behavior is keeping up with the 
times has come from an experiment by Alexander & Githler (24) regarding 
the chronic effects of jet engine noise on the structure and function of the 
cochlea. This study was not only of practical value in suggesting that jet 
engine noise presents an occupational hazard, but it contributed to our 
understanding of the function of the cochlear apparatus and of histological 
and electrical methods of examining it. Guinea pigs with normal hearing 
were exposed for a 15 min. period to the noise generated by a jet engine 
having an ambient sound level at cage distance in excess of 140 db. over a 
measurable frequency range of 25 to 7,500 c.p.s. Six to eight weeks later 
the sensitivity of each ear was determined by measuring cochlear potentials 
for 10 frequencies between 200 and 10,000 c.p.s. after which they were 
examined histologically. The group loss in sensitivity was found to be 34 db. 
Damage in the cochlea extended between 6 and 12 mm. from the basal end, 
with more destruction in the base than in the apex, which was interpreted 
as supporting the contention that the total cochlea is involved in response 
to sound. Electrical and histological techniques provided comparable infor- 
mation, the former showing depressed activity in accordance with the dis- 
tribution of damage as determined by the latter. 


EFFECTS OF BRAIN DAMAGE AND X-IRRADIATION 


A number of investigators have reported the effects of experimentally 
induced brain damage or of x-irradiation on various aspects of animal be- 
havior. The latter studies, especially, illustrate the way in which a matter 
of wide concern to society can exert its influence on trends of scientific 
research, and very probably they are destined to become rapidly developing 
areas of investigation. Because of space limitations these topics will be left 
to the chapter on Physiological Psychology. 


AUDIOGENIC SEIZURES 


The early experiments of Maier and his collaborators on abnormal be- 
havior in the rat, recently summarized in his book on frustration (25), had 
the effect of introducing a controversy which has stimulated a large amount 
of research, has ramified far beyond its original frame of reference, and has 
not yet reached generally satisfactory conclusions. The search for conditions 
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critical to the production of the several forms of audiogenic seizures reported 
in the early studies still goes on. 

Michels & Bevan (26), using rats, and Rabe (27), using mice, have 
examined the effects of the size of the test chamber on production of audio- 
genic seizures. In the first of these studies three sizes of test chamber, in 
the ratios of 1:2:4, were used. Groups of animals were given, individually, 
two minute stimulations in these chambers, one on each of 18 alternate 
days. The results revealed no significant differences in the frequencies of 
seizures among the groups, although significant differences did appear in the 
mean latencies of first running attacks in the largest and smailest chambers. 
Rabe’s experiment was based on the hypothesis that the characteristic reac- 
tions of animals in the test chamber serve an ‘‘. . . abreactive function, and 
the cumulative frustration effect was produced by motor restraint.”’ The 
design of Rabe’s study was similar to the one just described, although the 
precision in varying test-chamber size was lacking. Significant differences 
were found between the duration of seizures in the two chambers. The strain 
of mouse studied is characterized by a high fatality rate from convulsions; 
in this regard large differences were observed between animals tested in the 
two chambers, 85 per cent of those confined dying as compared with 25 
per cent deaths of those not confined. Since these two studies were not de- 
signed for purposes of comparison it is impossible to know whether the 
differences in results represent species differences or are functions of other 
conditions imposed by the experimental situation. 

The search for genetic influences on susceptibility to audiogenic seizures 
has almost run the gamut of possibilities from single Mendelian character- 
istics to multiple factors. Now Frings & Frings (28) have reported ‘‘few”’ 
differences in seizural behavior among three strains of mice, except for the 
distributions with respect to age of maximal effect, and even here their 
figures showed great overlap. This may suggest that genetic factors may 
have little influence and is contrary to the more carefully planned genetic 
study by Hall (29). 

A number of studies deal with the effects of experimental alteration of 
internal conditions of the organism on susceptibility to audiogenic seizures. 
Hurder & Sanders (30) have reported an experiment designed to test the 
hypothesis that ACTH? injections, sufficient to increase adrenal cortical 
size, modify the susceptibility of rats to audiogenic seizures. All animals 
were tested for seizure susceptibility and later injected with ACTH. Two 
hours after injections they were again tested. This second test failed to re- 
veal significant changes in the susceptibility of animals susceptible or non- 
susceptible on the initial test. Later assay of the adrenal glands showed the 
glands of susceptible animals to be heavier than those of nonsusceptible 
animals. These differences between effects of injected ACTH and gland size 
are puzzling and suggest that further research along these lines should be 
encouraged. Previous research (31) has suggested that susceptibility to 
audiogenic seizures is related to certain forms of middle ear infection, al- 
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though this relation has been questioned (32). Recently, two studies have 
been reported in which vestibular dysfunction and middle ear disorder have 
been experimentally induced and their effects upon seizures observed. Finger, 
Brice & Day (33), using three groups of rats, injected one group with strepto- 
mycin, another with dihydrostreptomycin, and the third with isotonic 
sodium chloride solution. The large doses of streptomycin used impaired 
vestibular function markedly and auditory function to a much lesser extent. 
The susceptibility to audiogenic seizures of animals so affected showed no 
significant change from that of the two control groups, and the conclusion 
was drawn that vestibular excitation can be eliminated as a critical condition 
for convulsions induced by sound stimulation. 

Marx & Chambers (34) used plastic-acetone injections into the naso- 
pharyngeal openings of the right eustachian tubes of rats to produce middle 
ear disorders. On the basis of pretests these had been classified nonsensitive 
to audiogenic seizures. Controls were given only acetone solution, which 
had negligible effects. Repeated tests failed to produce convulsions under 
either condition, although later autopsy showed heavy fluid or pus in the 
middle ears of all but two of the experimental animals. The results strongly 
suggest that middle ear disorder per se is not a critical condition for suscep- 
tibility to audiogenic seizure. 

Body temperature, another internal condition of the organism which 
might possibly influence susceptibility, has been subjected to experimental 
examination by Fuller & Rappaport (35). The effects on seizure frequency 
of 3 min. immersion in water at different temperatures were observed with 
most striking results. When wet the convulsion-susceptible mice were im- 
mune to convulsions, an immunity which disappeared when the animals 
were dried off rapidly and tested after 2 hr. or allowed to dry in their home 
cages and tested 24 hr. after immersion. The evidence upon which the re- 
searchers based their view that lowered body temperature was the important 
factor in the immunity comes from a comparison of the frequencies of 
seizure among animals immersed at 5°C., 23°C., and 38°C. Although immer- 
sion at all these temperatures reduced the frequency of convulsions, immer- 
sion at 5°C. was more effective than at 23°C., and both provided greater im- 
munity than immersion at 38°C. Whether or not body temperature is the 
critical condition in these effects will need further analysis. 

An experiment by Marx & Van Spanckeren (36) illustrates the way in 
which the behavior of an organism may interact with its susceptibility to 
audiogenic seizures. It was designed to test the hypothesis that reduction in 
seizural behavior will occur if the susceptible animal has learned to control 
the inducing stimulus. Experimental subjects were first trained to turn off 
the stimulus by tilting or touching a wooden pole extending into the test 
chamber, control animals being tested in a chamber containing a pole which 
did not allow control of the stimulus. After a training criterion had been 
reached the experimental animals were advanced to the next stage of the 
research design in which the pole was introduced at various intervals of time 
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following the onset of the whistle. Several of the experimental animals were 
able to make the instrumental responses to the pole at delay intervals of 16 
sec. or longer, and they showed significantly fewer signs of audiogenic 
seizure than did their matched controls. These results generally confirmed 
the hypothesis under test, but their real significance lies in the evidence they 
have provided that audiogenic seizures may be influenced to some extent by 
learned control. 


EFFECTS OF ELECTROSHOCK CONVULSIONS ON BEHAVIOR 


In 1937 Bini (37) reported on the use of electric current to induce grand 
mal type convulsions in dogs, a technique which soon became the most 
frequently employed therapeutic method in the somatic treatment of man’s 
behavior disorders. Since this initial experiment a large amount of research 
has been and continues to be reported in which the effects of ESC? on the 
behavior of several species of animals have been studied. 

Research has centered around the general question of what parameters, 
organismic and environmental, of the ESC situation influence its effects. 
Pierce & Patton (38) designed an experiment to study electroconvulsive 
thresholds as a function of the age and weight of the rats used. They found 
age-weight, threshold-weight, and age-threshold correlations to be high, 
and similar, in magnitude. Further analysis suggested that body weight is 
more effective in influencing convulsive threshold than is age, although 
when body weight is held constant an increase in age is associated with a 
decrease in threshold. Electric shock produces convulsions with three dis- 
tinct phases; they are shock, tonic, and clonic. Townsend, Russell & Patton 
(39) found that the duration of each of these phases is independent of the 
intensity and duration of the current inducing the convulsion, provided the 
current is of sufficient magnitude to reach the convulsive threshold. On the 
other hand, durations of the shock and clonic phases showed a small, but sig- 
nificant, curvilinear relationship with the cumulative effects of a series of 
treatments. Kurahashi (40), using cats as subjects, designed his research to 
compare the effects on learning of convulsive and subconvulsive stimulation 
and of two different intervals between treatment, 24 and 48 hr. Decrements 
in learning were clearly observable following convulsions, but stimulation 
leading to “‘misfits’’ produced no measurable effects. Recovery was faster 
following subconvulsive than following convulsive stimulation. The results 
were interpreted as indicating greater effects on learning and slower recovery 
with the shorter than with the longer intervals between treatments. Brown 
& de la Garza (41) observed that rats given ESC both during and before the 
learning period begins are unable to inhibit anticipatory responses when 
performing in a maze. The preshock group also were more erratic in their 
performance. 

The points of greatest departure from previous research on the effects of 
ESC on learning and retention have been either the use of new species as 
subjects or of new problems to be learned and retained. Hamilton & Patton 
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(42) studied the difference in rates of spontaneous recovery between (a) rats 
receiving ESC immediately after establishment and extinction of a condi- 
tioned escape response and (b) control animals undergoing no experimental 
treatment during a period of time equivalent to that required for the ESC 
treatment. The results indicate that ESC favors spontaneous recovery, but 
that this effect is only transitory. Brown & de la Garza (41) have investi- 
gated the effect of ESC on the anticipatory responses of rats during the 
learning of a multiple-T maze under conditions of dim illumination and hom- 
ogeneous surroundings. Controls learned the maze during a period in which 
they received one trial per day for 15 days, while the convulsed animals did 
not reach the criterion of learning. The main difference between the groups 
appeared to lie in the failure of the convulsed animals to inhibit anticipatory 
responses. Braun and his colleagues (43, 44) reported experiments on rhesus 
monkeys, designed to study the effects of ESC on the learning and retention 
of problems varying in difficulty. Twenty convulsions, induced at a rate of 
three per week, resulted in an impairment in the retention of an object- 
quality discrimination learning set, established prior to the shock series, 
but an impairment which was small in magnitude and very temporary in 
duration (43). The differences between the convulsed and control animals 
were most apparent on the differential cue trials when there appeared a 
tendency of the convulsed animals to adopt a response pattern of perseverat- 
ing on the side which previously had been correct. This latter observation 
is of particular interest when compared with studies of reactions to stress 
generally in which rigidity in adhering to previously learned responses de- 
spite their inadequacies is most marked. A second experiment (44) dealt 
with the effects of a series of 20 ESC treatments on the performance of the 
monkey in a task of greater complexity, the oddity problem as used by 
Harlow and his collaborators (45). No significant differences in learning per- 
formance were found between the convulsed and control animals, both 
groups showing the progressive interproblem improvement characteristic of 
the formation of a learning set. Braun has interpreted these results as show- 
ing that, with the monkey, effects of ESC are not related to problem dif- 
ficulty as they appear to be with the rat. Thus, the comparison of different 
species with regard to these particular aspects of behavior has led to the 
discovery of significant phylogenetic differences, the type of observation for 
which the comparative method is admirably suited. However, since training 
on the problems began 94 days after the last ESC and extended over four 
additional months, the possibility exists that temporary effects related to 
difficulty of problem may have dissipated prior to the learning. 

Bendig & Patton (46) obtained positive results in a study of the effects 
of ESC on latent learning. Rats fed ad libitum with food and water in their 
home cages were given a series of trials in a one choice T maze with escape 
from water as the form of motivation. Food pellets were present on the same 
one of the two landing platforms during these trials. Following ESC treat- 
ment of the experimental animals, both controls and experimentals were 
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given a series of test trials while operating under 23 hr. food deprivation. 
The results support the conclusion that ESC significantly decreased the 
effects of ‘‘latent learning.” 

Two studies at the University of Chicago laboratory have added to our 
knowledge concerning the effects of ESC on a “‘conditioned emotional re- 
sponse.’ By reinforcing a light stimulus with electric shocks to the feet, 
Brady & Hunt (47) rapidly produced in their rats a response of crouching 
and defecating to the presentation of the light alone. After the response was 
established experimental animals were given a series of ESC and the controls 
an equal number of pseudo-ESC. The convulsions had the immediate 
effect of almost eliminating the condition response. Hunt, Jernberg and 
Brady (48) showed that similar effects also appear in terms of a marked 
reduction of lever pressing in the Skinner box. In both experiments the ef- 
fects were only temporary, the conditioned responses reappearing during the 
first 30-odd days following the treatment. That a ‘‘vestigial form’’ of the 
conditioned response survived even these temporary effects was suggested 
by the fact that experimental extinction following treatment interfered 
with the reappearance of the response. 

In his studies on the effects of ESC upon learning in monkeys Braun 
(49) has included a study of delayed responses. Such responses, presumably 
involving symbolic behavior and memory ft action, showed no long-term 
impairment following ESC. The design of the experiment did not allow 
examination of the possibility that temporary losses might appear. 

During the past few years several studies have been reported in which 
the effects of ESC on the behavior of animals treated while under the in- 
fluence of various drugs have been examined. Recently Thomas & Stone 
(50) have convulsed albino rats after treatment with dilantin sodium, an 
anticonvulsant. No significant differences were observed in the subsequent 
maze performance of these animals and that of a shock-control group, al- 
though the ESC was associated with performance significantly worse than 
that of a drug-control group. Hunt, Jernberg & Lawlor (51) have studied 
the effect of ESC, administered under ether anaesthesia, on a conditioned 
emotional response. Three groups of rats were used, the first receiving ESC 
only, the second being convulsed while under the anaesthesia, and the third 
receiving ESC followed by anaesthesia. The results confirmed previous re- 
ports by other investigators that ESC does not have its usual effects on be- 
havior of the rat if the convulsions are prevented from occurring by anaes- 
thesia at the time of treatment. 


STUDIES OF CONFLICT AND BEHAVIOR DISORDERS 


Pavlov (52) once wrote: ‘‘... the decision, or the conditions favorable 
to a decision, of many important questions of etiology, the natural systema- 
tization, the mechanism and finally the treatment of neuroses in the human 
being lies in the hands of the animal experimenter.’’ Contemporary investi- 
gators have taken this charge very seriously as evidenced by the relatively 
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large number of recent studies of conflict and abnormal behavior patterns 
employing animals as subjects. The current trend is to view these studies in 
the framework of learning rather than frustration theory. 

Some time ago Miller (53) described three basic varieties of response com- 
petition or conflict. Recently he and his collaborators, using rats as subjects, 
have reported a series of three experiments designed to test the extension of 
his original analysis to certain details of the phenomena of ‘‘displacement.”’ 
Miller & Kraeling (54) studied the generalization of conflict to new stimulus 
situations. An approach-avoidance conflict was established by training 
hungry animals to run from one end to the other of a short alley in order to 
reach foot and then giving them electric shocks until they failed to touch the 
food on three successive trials. Miller’s analysis would predict that such 
conflict would be revealed in vacillations between approach and avoidance 
responses at some point in the alley. The generalization of this conflict was 
studied by testing some of the animals in the same alley in which they were 
trained, others in a somewhat different alley, and still others in a very dif- 
ferent alley. Appropriate control groups were included in the research design. 
Animals tested in all the alleys showed the vacillation predicted, indicating 
generalization of the conflict. By observing the proportion of animals touch- 
ing the food in the different alleys during test trials it was concluded that 
approach generalized more strongly than avoidance. In a second experiment 
Murray & Miller (55) compared generalization gradients of approach and 
avoidance when the age of the habits and the number and distribution of 
training trials were controlled. Four groups of animals were used, two rein- 
forced by food and two by electric shock. One of each of these latter groups 
was given continuous and the other partial reinforcement. The strength of 
pull on release after being restrained halfway to the goal served as the 
measure of the strength of approach and avoidance responses. Four test 
trials were run, two in the same alley in which the animal had been trained 
and two in a different alley. The steepness of the gradients of generalization 
was represented in each case by the difference between the strength of the 
pull in the two alleys and proved to be greater for avoidance than for 
approach. This difference in slope was highly significant when the measures 
of the test trials were averaged, although it did not appear in the first test 
trial of the group receiving continuous reinforcement. Another experiment 
by Miller & Murray (56) was designed to test the prediction that the gradient 
of stimulus generalization should be steeper for responses motivated by a 
learned, response-produced drive dependent upon external cues than for a 
primary drive independent of external cues. Avoidance responses were estab- 
lished to electric shock in an alley containing distinctive cues. Half the 
animals were then tested with the shock turned off so that they were 
motivated only by the secondary drive of ‘‘fear,’’ which was assumed to vary 
with the cues in the alley. The other half were tested with the shock on and 
hence motivated by a primary drive, which could not be assumed to vary 
with alley cues. Half of each of these two groups were tested in the same 
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alley in which they had been trained and the other half in a different alley. 
Strength of response was measured in terms of the force exerted by the 
animals against a temporary restraint. The gradient of stimulus generaliza- 
tion of the secondary-drive animals was steeper than that of the primary- 
drive animals and confirmed the prediction under test. The authors suggested 
that alternative explanations of the results are improbable, but cannot be 
entirely eliminated. 

Other studies have been concerned with different manifestations of be- 
havior under stress. Ullman (57) has followed up some of his earlier research 
on the production of a ‘‘compulsive eating symptom”’ in rats. He had sug- 
gested that four drives were operating in the situation to which his animals 
were exposed: hunger, hunger-fear, pain, and pain-fear. An observed in- 
crease in eating during electric shock was presumed to occur because eating 
had become a generalized tension-reducing response in the situation, a point 
of view supported by the fact that the animals ate when shock was applied 
despite having been previously satiated. In his follow-up study Ullman has 
set out to discover the relative importance of each of three variables inti- 
mately concerned in producing the symptom: level of hunger, amount of 
food-taking training, and intensity of electric shock. In a factorial research 
design each of these variables was present at two different levels, and the 
results indicated that the combination of conditions most favorable to the 
development of ‘‘compulsive eating’’ was: the lesser training in eating and 
the higher levels of hunger and shock. 

In another experiment Datel & Seward (58) reported on observations of 
the persistence of an ear-scratching response in the rat. Collodion was 
applied to the ears of the experimental animals on five successive days, 
producing intense ear-scratching behavior. Control animals received no such 
application and showed no such heightened level of scratching activity. The 
groups had been equated on frequency of scratching during a four-day ob- 
servation period. Ear-scratching was sampled during 10 min. observation 
periods once each day. Analysis of the results showed that for at least 16 
days after cessation of treatment mean scratching scores for the experimental 
animals were significantly higher than those for the controls. The possibility 
that these results might be explained by residual after-effects of the collodion 
on the tissues was ruled out on two grounds: first, five or six days after 
applications of collodion ceased evidence of skin irritation and sensitization 
had disappeared; and, second, the difference between the groups did not 
diminish with time. Several possible explanations of the results were dis- 
cussed, the favored approach being via Hull’s concepts of secondary motiva- 
tion and secondary reinforcement. 

Bailey & Miller (59) have studied simple approach-avoidance conflict by 
observing the differential effects of a depressant drug, sodium amytal, on 
the relative strengths of the competing response tendencies. Hungry cats 
were subjected to conflict involving approach to food and avoidance of 
electric shock. When the animals remained away from the food on three 
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successive test trials they were divided into two groups, the experimentals 
being given an intraperitoneal injection of sodium amytal and the controls 
an injection of normal saline. During test trials run at 5, 10, 15, and 20 
min. after injection the two groups showed significantly different reactions 
to the conflict situation, all the experimental animals but only one control 
approaching the food within four trials. These are the results predicted on 
the grounds that sodium amytal should have a greater effect in reducing 
the fear motivating avoidance than the hunger motivating approach. Of 
interest are the authors’ remarks that completely negative results were 
obtained in exploratory studies using rats as subjects. This is another sug- 
gestion that important differences may await research designed with phyletic 
comparisons in view. 

The extensive series of studies by Maier and his colleagues (25) on the 
characteristics of behavior induced by ‘“‘frustration’”’ in the insoluble prob- 
lem situation have been concerned almost entirely with responses made at 
single choice points. At the 1953 Annual Meeting of the British Psychological 
Society, Watson (60) reported a study of stereotyped behavior ina multiple- 
choice insoluble problem situation. Rats were presented with an insoluble 
problem involving four choices, thus providing a large number of possible 
seriation response patterns. Stereotypes, many extremely persistent, de- 
veloped, involving sequences of responses. The animals received 200 trials 
under these conditions. After this phase of the research was completed, a 
soluble problem was introduced, which all the animals learned. Analysis of 
the results suggested that behavior on both the insoluble and soluble prob- 
lems was consistent with that which would be predicted from conventional 
learning theory. 

During the period of the present review the question as to whether 
reactions to stress fit into the general framework of learning theory or if 
another type of schema is required has received considerable attention. The 
controversial center of a number of contributions has been Maier’s theory 
based upon qualitative distinctions between ‘‘goal-motivated”’ and ‘‘frustra- 
tion-instigated’’ behavior. The majority of contributors have been rather 
unhappy with Maier’s formulations, although at least one, Austin (61), has 
concluded that Maier’s research program in the case of frustration has 
achieved some success. Maier’s present position was stated in a 1951 paper 
(62) in which he argued that certain features of the experimental results 
obtained by other workers are not satisfactorily covered by current anxiety- 
reduction theory and that certain of the findings in his own laboratory could 
not be explained in terms of learning theory. The rebuttal of these arguments 
has come in the form of reinterpretations of the controversial results and of 
experiments designed to test various aspects of Maier’s theory. 

Wolpe (63, 64) in critiques of the literature on ‘‘experimental neuroses,” 
has rejected Maier’s theory on the grounds that it is based upon a misinterpre- 
tation of the ‘‘abnormal behavior fixations” appearing under conditions of 
the insoluble problem. Wolpe has pointed out that Maier found it necessary 
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to compel his animals to jump by use of air blast or electric shock. Each 
time an animal was compelled to jump the response ended the blast or shock 
and, consequently, led to a reduction of the drive that forced the response. 
The response was not unrewarded as Maier has assumed. Wolpe has also 
suggested that the punishment, e.g., nose-bumping, associated with the stere- 
otyped response is ineffective as a response-eliminating agent in relation to 
the strength of reinforcement associated with escape from the air blast or 
shock. It is not possible here to examine the details of Wolpe’s analysis of 
“abnormal fixation’’ experiments, but the example given above illustrates 
the general approach which has led him to favor explanations based upon 
learning rather than frustration theory. 

Although the manner in which Maier’s theory has been formulated makes 
critical tests of it difficult to design, two studies have been reported which 
were planned to examine certain deductions from it. The results of both of 
these studies support interpretations based on learning theory. Knépfel- 
macher (65) has taken as his point of departure Maier’s assumption that the 
rigidity of stereotyped responses is a function of frustration. Continued pun- 
ishment during the trials of Maier’s customary research procedure should 
not only fail to decrease rigidity but actually to increase it. If this continued 
frustration were eliminated, for instance, by rewarding the stereotyped re- 
sponses once they were established, the stereotype should become more 
amenable to extinction or, at least, not less amenable. On the other hand, if 
principles of learning theory are operating, reinforcement of the stereotyped 
response should tend to strengthen it. Rats were used as subjects and ex- 
posed to an insoluble problem in a discrimination unit employing escape 
from water as motivation. Each animal forming a stereotype was assigned 
to one of four groups. The first group served as a control and the other three 
were given 60, 120, and 180 rewarded trials, respectively. Finally, all groups 
were tested on a soluble problem in the same apparatus. Analysis of the re- 
sults showed that none of the stereotypes was sufficiently rigid to meet 
Maier’s criterion of ‘‘fixated’’ responses. The strengths of the stereotypes 
were directly proportional to the number of rewarded trials, which was inter- 
preted as evidence that they were not qualitatively different from learned 
responses. 

Wilcoxon (66) has presented additional evidence that conventional 
S—R learning theory can account for the phenomena reported by Maier. 
Two experiments were carried out. The first was designed to isolate the 
effects of partial and nondifferential reinforcement in the general situation 
employed by Maier. Three groups of rats were trained in an initial habit 
under different experimental training conditions: continuous reinforcement 
involving invariable reward for correct and punishment for incorrect re- 
sponses; partial reinforcement where the correct response was rewarded half 
the time and punished the other half, while incorrect responses were always 
punished; and the conditions of the insoluble problem situation. Finally, all 
animals were tested on a new and soluble problem requiring abandonment 
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of the initial habit. Analysis of the results showed that animals of the in- 
soluble problem group took longer to establish a consistent response than 
did animals of the other groups. This was interpreted as indicating that non- 
differential reinforcement per se increases variability, an interpretation ‘‘con- 
trary to Maier’s point of view.’’ The results also supported the conclusion 
that partial reinforcement of the first habit is of primary importance in the 
development of fixations. It was observed that responses established under 
conditions of partial reinforcement were more rigid in the final tests than 
those established under conditions of the insoluble problem. Wilcoxon’s 
second experiment was designed to examine techniques other than manual 
guidance for altering fixations once they were established. It provided re- 
sults which indicated that such alteration was possible by applying relatively 
simple techniques based on learning principles. 

Although most of the studies of reactions to stress have been oriented 
around problems of psychological theory, their potential value in supple- 
menting clinical research in psychiatry and ‘‘psychosomatic’’ medicine has 
been recognized. Beach (67) has suggested that ‘‘... if ‘psychosomatic’ 
relationships in the human being are defined objectively and scientifically 
it becomes apparent that comparable phenomena occur in infrahuman 
animals.”’ Russell (68) has pointed to certain problems in psychiatry which 
can only be attacked experimentally by the use of infrahuman subjects. 


CONCLUDING REMARKS 


Areas within comparative psychology have been selected for inclusion 
in the present review primarily in terms of the relative frequency with which 
they received attention from investigators during the period covered. In 
one case, experimental psychogenetics, research reports have been few, but 
promise great. 

No radically new principles or techniques have been suggested in the 
publications of the comparative ethologists, whose general approach re- 
ceived so much attention in the past two Reviews. It would seem that we are 
now in a stage where this approach is undergoing careful and detailed scru- 
tiny, as evidenced, for instance, by Ginsberg’s (69) lengthy ‘‘reconstructive 
analysis” of the concept of instinct. During the past year research on animal 
learning has remained a ‘‘current trend” as it has for so many years in the 
past, but the new results are more suitably reviewed in the chapter on learn- 
ing. Several investigations (70 to 73) have examined specific instances of the 
reciprocal relations between chemical balance within the organism and the 
behavior of the organism. There have been very few aspects of behavior 
which have not been examined as they occur in infrahuman animal species, 
but very few of these examinations have followed the design which the 
comparative sciences as a whole might have expected of a truly comparative 
psychology. 
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PHYSIOLOGICAL PSYCHOLOGY? 


By R. A. Patton 
Department of Psychology, University of Pittsburgh, Pittsburgh, Pennsylvania 


The physiological psychologist does not deal with unique problems but 
seeks a common ground for a science of behavior with related disciplines such 
as biochemistry, neurophysiology, and other biological sciences. During the 
period covered by this review progress has been evident in the continued de- 
velopment of precise methods for evaluating behavioral changes as a function 
of specific central nervous system damage. The psychologist and the bio- 
chemist have worked more closely together on common problems. A number 
of new approaches to clinical behavioral problems have continued their de- 
velopment. No final answers to these problems have been presented. The 
studies reviewed below, however, give evidence of a satisfactory progress 
toward the formulation of the important questions. 


EARLY ENVIRONMENTAL EFFECTS ON BEHAVIOR 


During recent years there has been an increasing interest in the important 
problem of the influence which early environmental conditions may exert on 
later behavior. One experimental approach has been the study of effects of 
early experiences upon later behavior where the early experience was ex- 
tremely varied, as compared with a restricted environment. Hymovitch (1) 
blinded rats either in early life or at maturity and then reared them in a rela- 
tively free environment or in an environment which allowed only a restricted 
amount of activity. At maturity the animals were tested in a closed field 
apparatus designed by Hebb and Williams and on an enclosed 10-unit 
T-maze. It was found that the differences between early- and late-blinded 
animals were not significant. The differences in the performances of the free 
and restricted groups on the T-maze were likewise not significant. The free 
groups were found to be superior to the restricted groups when their per- 
formance was measured in a closed field test. In two additional groups free 
environmental experience was received during early life and restricted ex- 
perience later on, while in another group this order of experience was reversed. 
In the tests conducted at maturity, the former group was found to be supe- 
rior to the group receiving early restricted environment. In a comparable ex- 
periment, Forgays & Forgays (2) secured similar results which they inter- 
preted in terms of the differences in opportunity for perceptual experience 
and learning in the various environmental situations. Bingham & Griffiths 
(3) maintained rats in similarly divergent environments for a 30-day period 
which began when the animals were 21 days of age. Tests were conducted two 


1 This review covers the period from June, 1952 to June, 1953. 
2 The following abbreviations are used in this chapter: ACTH (adrenocortico- 
tropin) ; EEG (electroencephalogram). 
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weeks later in five different situations: an inclined plane maze, a jumping ap- 
paratus, an open field temperament test, a test for sound-induced seizures, 
and the Warner-Warden maze. Animals reared in the free environment were 
clearly superior to those reared in the restricted environment in maze learn- 
ing performance while no significant differences appeared in the other tests. 

A second approach to this problem consisted in the study of the effects of 
early experience of adverse environmental conditions such as x-irradiation 
or anoxia. In a study referred to later, Levinson (4) observed the relationship 
between different amounts of x-irradiation given at five stages of fetal life 
and the postnatal learning ability of rats. At 50 days of age the performance 
of the rats was found to decrease as a function of the amount of fetal irradia- 
tion applied. Greater variability in the performance of the irradiated animals 
was also reported. 

Becker & Donnell (5) studied the effect of asphyxiation on guinea pigs by 
occluding the maternal uterine vessels after one animal in each litter had been 
delivered to serve as a control. One group of animals was released in time to 
recover spontaneously. A second group was asphyxiated until comatose and 
then artificially revived. Problem boxes were utilized 8 to 10 weeks later to 
evaluate learning performance. Problems of two levels of difficulty were 
utilized and greater error and perseveration scores were found for the experi- 
mental animals. Two weeks later retests showed that retention was affected 
by asphyxiation in a similar manner. Hurder & Sanders (6) used one-year- 
old rats exposed to different conditions in a decompression chamber. Two 
experimental groups were exposed to a simulated altitude of 30,000 ft. for 
3 hr., and 34,000 ft. for 38 min. When 100 days of age the groups were trained 
on Stone’s multiple-T alley-maze. Learning scores for the experimental ani- 
mals were not significantly different from those of the controls. 

Griffiths & Stringer (7) studied the effects of intense stimulation during 
infancy on adult behavior of the rat. The following four types of stimuli were 
used: rapid rotation, extremes of environmental temperature, auditory stim- 
ulation, and electric shock. Each rat in the experimental groups received a 
3 min. period of stimulation daily during the first nine days of life and twice 
daily until the fifty-seventh day. At 60 days of age subjects were tested in 
four different situations: the Lashley-discrimination apparatus, the open 
field test for emotionality, a test for sound induced seizures, and the 
Warner-Warden maze. No significant differences appeared to distinguish the 
performances of the experimental groups from those of untreated control 
animals. 


SENSORY PROCESSES 


Although research on sensory processes is reviewed in the chapters on 
Vision and Audition, several studies will be covered here since they are rele- 
vant to the orientation of the present chapter. 

Montgomery & Heinemann (8) have attempted to apply some of von 
Frisch’s findings on the function of polarized light as a basis for navigation in 
bees, to homing behavior in birds. These authors reasoned that birds should 
be able to discriminate patterns of light polarized in one plane from polarized 
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light patterns presented in another plane. Homing pigeons were used as sub- 
jects and the stimuli were presented on ground glass stimulus keys. The ani- 
mals were rewarded only for responses to polarized light in one plane and 
maintained at 80 per cent of normal body weight during acquisition and 
extinction trials. This study gave no evidence that pigeons could discriminate 
between planes of polarized light, although they did discriminate brightnesses 
with ease. 

Three well-designed studies were reported which dealt with color sensi- 
tivity in Drosophila melanogaster. Fingerman (9) and Fingerman & Brown 
(10, 11) have presented evidence indicating that this organism may possess 
true color vision. Observations were made on the strength of phototaxis in 
response to white and monochromatic light. Responses to colored light were 
lost with a reduction in the intensity of the stimulus, and the authors sug- 
gested the existence of photopic and scotopic mechanisms similar to those 
which have been described in the vertebrate eye. With different groups of 
organisms, spectral response curves were plotted when 17 wave lengths were 
presented and phototactic orientation responses noted. The well-established 
genetics of Drosophila permitted the careful selection of six groups of animals 
differing only in eye color. There was the suggestion that the spectral re- 
sponse curves differed not only in strength but also in form and might have 
been dependent upon a differential absorption of light by differing eye pig- 
ments. That the duplicity theory may possibly apply to the compound eye is 
supported by anatomical findings of both long and short axon retinal cells 
in different Arthropoda. The last study in this series investigated a possible 
Purkinje shift in the vision of Drosophila. Spectral response curves were ob- 
served to change as the intensity of light was decreased. The above men- 
tioned investigations outlined both physiological and comparative data 
which relate the compound eye to the visual structures found in higher ani- 
mal species. 

Alexander & Githler (12) in a study related to military problems, studied 
the effects of jet engine noise on the function of the cochlea. Guinea pigs 
were exposed for a 15 min. period to jet engine noise which had an ambient 
sound level in excess of 140 db. over a measured frequency range from 25 to 
7500 c.p.s. Six to eight weeks later auditory sensitivity was measured by ob- 
serving cochlear potentials for 10 frequencies between 200 and 10,000 c.p.s. 
The group loss in sensitivity was found to be 34 db. Histologic examination 
of the ear structures revealed the existence of cochlear damage between 6 
and 12 mm. from the basal end, supporting the contention that the entire 
cochlea is involved in responses to sound. Electrical techniques showed de- 
pressed cochlear activity which correlated with the evidence of histological 
damage. This investigation has a practical value in its applications to mili- 
tary problems and also presents additional information about cochlear func- 
tion and methods for investigating it. Dice & Barto (13) studied the sensi- 
tivity of mice (Peromyscus) to ultrasonic stimulation. One sound source 
employed varied from 500 to 16,000 c.p.s. and the other from 10 to 100 kc.p.s. 
Sound pressures of 1, 5, 10, or 20 dynes per centimeter at the animal’s ear 
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were used. The appearance of conditioned responses was interpreted to 
indicate sensitivity over a range from 500 to 95,000 c.p.s. in animals from 
25 to 70 days of age. Auditory sensitivity for these animals was found to de- 
crease with age. 

Bevan & Hunt (14) investigated the effects of sensory (proprioceptive) 
inflow on convulsive seizures produced in rats by maintaining them on a 
synthetic diet deficient in magnesium. Proprioceptive damage was induced 
by sectioning the dorsal funiculus of the cord at the cervical level. No differ- 
ences were observed in either the frequency or the severity of the convulsive 
reactions which appeared in all deficient groups. As noted by a number of 
previous investigators, auditory stimulation was proved to be an effective 
method of inducing seizures in magnesium deficient animals. 


THE EFFECTS OF X-IRRADIATION 


Furchtgott (15) has studied the influence of x-irradiation on the acquisi- 
tion of a brightness discrimination habit under conditions of low illumina- 
tion. Rats were irradiated over the whole body and their performance was 
later compared with that of controls. Irradiated animals were inferior to the 
controls, this difference being most apparent immediately after treatment. 
It was suggested that this evidence indicated that damage to peripheral re- 
ceptors rather than central nervous damage was responsible for the observed 
effects. Levinson (4) observed the effects of irradiation during the fetal period 
upon postnatal learning in rats. Pregnant rats were exposed at five different 
stages of gestation to different specific dosages. Learning tests were subse- 
quently carried out in the Lashley III maze when the young were 50 days 
old. With increasing amounts of irradiation significant impairment and 
variability of performance were found. It is of interest that the greatest ef- 
fects on learning seemed to be apparent when treatment was given on the 
13th day of gestation. In accordance with the findings of previous investiga- 
tions, the suggestion was made that during the development of neural struc- 
tures basic to maze learning a critical time might exist when they are par- 
ticularly susceptible to damage. Another study in this research area was 
carried out by Arnold (16) who exposed the heads of rats to amounts of x- 
irradiation which were usually lethal when administered over the entire 
body. These animals had learned a Tolman-Honzik T-maze before treatment 
and were tested 1, 12, and 25 days after irradiation for retention. No signifi- 
cant impairment of performance resulted under the conditions described. 

Two other studies have appeared in which the authors attempted to study 
behavior other than maze performance in relation to x-irradiation. Kimel- 
dorf & Newsom (17) investigated external environmental factors influencing 
the survival of rats following irradiation. Such animals died when subjected 
to prolonged exposure to cold temperatures (6° C.). This temperature was 
well tolerated by normal animals. It was found that a similar dose of irra- 
diation was not lethal for control animals at a normal room temperature of 
23° C. Nims & Sutton (18) exposed rats to whole body x-irradiation and 
measured weight changes and water consumption. The over-all metabolism 
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of fasted rats was not changed appreciably and the degree of hydration of 
tissues and weight loss seemed independent of the animal’s initial weight. In 
nonfasted, irradiated rats, the weight loss persisted for a period of time which 
seemed proportional to the dosage received. Animals given sublethal dosages 
grew at a rate which was parallel to that shown by the control animals. 


EXPERIMENTAL BRAIN DAMAGE AND BEHAVIOR 


The study of relationships between functional areas of the brain and 
behavior has been of recurrent interest in the field of pi:)siological psy- 
chology. The following studies are representative of the direction in which 
this type of research has been proceeding during the past year. 

Rosner (19) trained groups of rats on two different tasks, a retracing, 
Lashley III maze and a nonretracing multiple-T maze. With lesions of less 
than 15 per cent of the neocortex the postoperative performance of the ani- 
mals was: not impaired. When lesions of over 20 per cent were made there 
was the tendency toward refusal to run the maze and aversion to operating 
the retracing doors. Mesocortex and allocortex lesions, involving phylo- 
genetically older parts of the pallium, were associated with significantly 
poorer postoperative performance than was removal of neocortex only. 
Similar amounts of neocortex were destroyed in both groups. This study 
suggested the great importance of exploring the differential effects on be- 
havior of cautery as against aspiration methods. There is some evidence that 
the effects of cautery may extend beyond the bounds of the lesion as defined 
by standard anatomical techniques (20). Forgays (21) showed that delayed 
aphasic, motor, sensory, and intellectual defects followed cerebral opera- 
tions performed on human patients for the relief of epileptic seizures. Such 
delayed symptoms usually appeared about two days after operation and 
were followed by complete recovery in about a month without special train- 
ing. All major subdivisions of the cortex were represented in operations with 
a group of 50 patients. Forgays (22) also reported that delayed behavioral 
disturbances followed frontal plus parietal lesions, and also bilateral parietal 
lesions. After the latter type of lesion the behavioral effects were less pro- 
nounced. If the lesions were made bilaterally in the frontal and occipital 
areas or in the anterior part of the cerebellum no significant behavioral im- 
pairment was found. In these studies the ‘‘closed field’ test of rat ‘‘intelli- 
gence’ devised by Hebb and Williams was used. It seems significant that 
the brain lesions described above produced behavioral disturbances which 
appeared at some time after cerebral injury. Recovery from these disturb- 
ances occurred after a few weeks without the necessity of retraining. 

Pickett (23) trained blinded rats to run elevated mazes with narrow and 
wide pathways and also an alley maze, the animals being forced to rely 
mainly on tactual and kinesthetic cues. The mazes were rotated randomly 
in order to eliminate extramaze cues. Three groups of animals were used: 
the first had lesions of the motor and somatic cortical areas; the second, 
lesions of the posterior cortex excluding somatic areas; and the third, sham- 
operations. The results indicated that while the loss of motor areas handi- 
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capped performances on elevated areas, the cortical loss in the posterior 
areas did not do so. The results with the alley maze indicated that anterior 
lesions were not significantly more damaging than posterior lesions although 
a high correlation existed between loss of tissue and running ability. Two 
studies were reported by Chow (24, 25) on visual discrimination and condi- 
tioned reactions in the monkey. The experimental animals were trained and 
tested before and after the operations. In one group the prestriate areas, 
including lateral, medial, and frontal areas, were removed. Temporal neo- 
cortex was removed in another animal and the frontal eye fields were ablated 
in two other monkeys. The animals with bilateral temporal neocortical 
lesions were tested 12 days after operation but failed to score at better-than- 
chance levels. However, when the animals were trained on new discrimina- 
tion problems after this test they showed quick recovery of the lost habits 
when tested again at the end of three months. The animals with extensive 
prestriate lesions failed to distinguish between food and inedible objects, and 
in addition were hyperactive and gave evidence of disorientation. In addi- 
tion to showing the above symptoms an additional animal with the loss of 
temporal neocortex was retarded in relearning conditioned responses. No 
significant symptoms were associated with lesions in the frontal eye fields. 

Zubek (26) reported the effects of extirpating somatic sensory areas I 
and II on roughness discrimination and tactual form discrimination in the 
cat and the rat. Cats showed amnesia for the testing situation when area 
II was removed either in combination with area I or alone. There was a loss 
of ability to learn or relearn fine discriminations. In the case of rats, it 
was found that ablation of either or both of these areas had no effect on tac- 
tual discrimination. While these studies with two animal species are not in 
agreement it is possible that the successful discriminations reported in the rat 
study were based on sensitivity other than tactual. 

Evarts (27) studied the ability of the monkey to perform auditory dis- 
criminations after massive lesions involving up to 90 per cent of the auditory 
cortex. The author observed neither loss of ability to learn frequency dis- 
criminations nor loss of previously learned discriminations. The experiments 
required discrimination between tones of 350 and 3500 c.p.s. Meyer & Wool- 
sey (28) investigated the effects of extirpations in various combination of 
auditory areas I and II, somatic area I, and the posterior ectosylvian gyrus 
on frequency and intensity discriminations. These lesions resulted in com- 
plete and lasting deficits for frequency discrimination but not for intensity 
discrimination. The extirpations of either auditory areas I and II alone, 
however, seemed to have little effect on either type of discrimination. In 
two other studies with monkeys, Evarts (29, 30) removed large amounts of 
auditory cortex and tested the animals on auditory-visual association prob- 
lems. While these animals performed significantly better than chance they 
performed significantly less well than they did before operation. In other 
groups of animals the cortico-cortical connections (prestriate) were disrupted 
and auditory-visual associations were observed. Extensive lesions of the 
prestriate areas resulted in no loss in the complex problems utilized. Behav- 
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ioral impairment in learning new tasks was not observed. Along with other 
evidence these results suggested that there are pathways other than the pre- 
striate which mediate associations between visual and auditory inputs. It is 
possible that thalamic pathways are involved. 

During the period covered by this review several investigators tested the 
capacity of experimental animals to solve problems involving anticipatory 
behavior or temporal discrimination. Utilizing a Hull-Hill (31) type maze, 
Carpenter observed the effects of removing parietal and frontal cortical 
areas on anticipatory behavior in the rat. Damage to the parietal region re- 
sulted in an increase in errors, whereas damage in the frontal area was asso- 
ciated with more anticipatory errors than were observed in control animals. 
The results of this study support the suggestion of previous authors that the 
increase in anticipatory errors may be understood in terms of a release from 
frontal inhibition. On the other hand, the impaired behavior of animals with 
parietal lesions might have resulted from an increase in perseveration asso- 
ciated with a release from parietal inhibition. 

Warren & Harlow (32, 33) reported two studies of learning of pattern 
and object discriminations by monkeys after prolonged postoperative re- 
covery from extensive bilateral lesions. The results showed that on pattern 
discrimination tests the frontal injury group did not differ significantly from 
the normal controls. Both groups of animals were definitely superior to the 
posterior group. Operated animals did not differ significantly from the nor- 
mal controls on object discrimination tests. In the second investigation 
these authors observed the effects of additional contralateral lesions on dis- 
crimination behavior. Monkeys with extensive unilateral lesions and de- 
struction of posterior association areas made significantly more errors than 
normal monkeys or animals with extensive unilateral damage and destruction 
of the contralateral frontal association areas. Harlow (34) also reported that 
although extensive unilateral cortical lesions left the operated group inferior 
on discrimination problems, they reached a high level of response after long 
training. 

Leary et al. (35) continued the study of monkeys with anterior and pos- 
terior cortical lesions on double-alternation problems at the Wisconsin lab- 
oratory. Animals with frontal lesions demonstrated a strong tendency toward 
stereotypy while large numbers of single alternations were performed by ani- 
mals with the posterior lesions. The Wisconsin group (36) presented addi- 
tional evidence of a similar nature in monkeys tested on temporal pattern 
and discrimination problems. Brain lesions similar to those of the previous 
investigation were studied. It was found that the posterior lesion and the 
normal groups did equally well on delayed response problems. Both excelled 
the performance of animals with frontal injuries. With discrimination prob- 
lems, however, the frontal lesion and normal animals excelled the perform- 
ance of those with posterior damage. These results were interpreted as re- 
futing the hypothesis of an equipotential associative cortex. There was evi- 
dence of ‘‘differential and complementary frontal and posterior syndromes,” 
revealed by an inefficient performance on all discrimination learning tasks 
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used and relatively efficient performance on tests of delayed response. Op- 
posite results were found for animals with the anterior syndrome. The com- 
bined data in these and related studies from the Wisconsin laboratory has 
indicated that (a) differential localization of large bilateral cortical lesions 
produces differential syndromes of intellectual loss; (b) no function (such as 
discrimination learning) is totally destroyed even by large bilateral lesions; 
and (c) quantitative losses in learned performances associated with large cor- 
tical lesions are greatly reduced after prolonged testing. It would seem that 
many studies overlooked a possible source of important information about 
brain function if postoperative tests are conducted immediately without ad- 
ditional long-term observations being made. 

Wade (37) compared the effects of prefrontal lobectomy and lobotomy 
on the behavior of the monkey. “Intellectual function’ was found not to be 
impaired in several tests, but there was a decrement in the performance 
scores on the tests of delayed response. Section of the associative fibers seemed 
to produce instability in some conditioned reactions, but did not impair 
behavior on delayed response tests. 

In a significant study Pribram et al. (38) made ablations in the dorsolat- 
eral frontal cortex in one group and the ventrolateral cortex in another 
group of guinea baboons. These operations were bilaterally symmetrical. 
Preoperative training on visual discrimination and delayed response was 
given. Postoperative tests yielded greater decrements on these tasks in the 
case of the dorsolateral animals than the ventrolateral group. These results 
suggested that pronounced behavioral effects associated with the ablation 
of frontal cortex in previous research must relate importantly to the loss of 
the dorsolateral region. 

Brody & Rosvold (39) have contributed to methodology in their study 
of the relationships of prefrontal lobotomy to the social behavior of monkeys. 
A rigid dominance hierarchy was established in a group of animals which 
was characterized by aggressive attacks directed toward members lower 
down in the order. The stability of the established hierarchy was lost after 
prefrontal lobotomy. The observed changes in a small social group appar- 
ently resulted from a decrement in learned avoidance responses in the op- 
erated, low status monkeys. The observed changes in avoidance and aggres- 
sive behavior were apparent in the upwardly directed aggression and upward 
mobility in the hierarchy. 

French, Amerongen & Magoun (40) presented evidence that there exists 
in the monkey, as in the cat, a centrally located region comprising the mid- 
brain tegmentum, the subthalamus and hypothalamus, and the medial 
portion of the thalamus in which evoked potentials can be recorded from 
peripheral stimuli applied to the somatic, visual, auditory, and visceral 
systems. Evidence is discussed which indicates that this region functions 
as a unit and is subject to excitation by peripheral sensory stimulation. This 
excitation affects electrical activation of the cortex and behavioral reactions, 
such as awareness of stimulation, arousal to wakefulness, or attention in the 
animal. Lindsley et al. (41) had previously presented a report dealing with 
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behavioral and EEG? changes resulting from chronic lesions in this region. 

In a second investigation, French & Magoun (42) extended the study of 
chronic lesions in this important area. Electrolytic lesions were made in the 
central cephalic portion of the brain stem of nine monkeys. Information 
on the behavioral effects of destruction of this area was desired. Although 
complete destruction resulted in death, smaller lesions produced akinesis 
and hypersomnolence. Electroencephalograms were characterized by hyper- 
synchrony not responsive to sensory arousal stimuli. 

Anand & Brobeck (43) showed that bilateral electrolytic lesions in the 
amygdaloid nuclei of rats produced the following: (a) no change in food in- 
take; (b) a decrease in spontaneous activity; (c) a transient fall in body tem- 
perature; (d) no development of savageness. Similar effects were observed 
in cats. In the latter animals food intake remained normal in those whose 
lesions did not invade the lateral hypothalamus. Beattie (44) has described 
a new instrument for the accurate placement of small brain lesions in the rat. 
The lesions are produced electrically with good control over various param- 
eters of stimulation. The electrode is mounted on a _ pantograph-type 
carrier, readily removable to allow access to the field of operation. 


BIOCHEMICAL CORRELATES OF BEHAVIOR 


Several studies designed to investigate biochemical factors that may aid 
in differentiating between the behavior disorders have appeared recently. 
In a well designed experiment by Easterday and co-workers (45), groups 
of 10 subjects were given a controlled amount of exercise on a bicycle ergom- 
eter. In addition to a normal group, manic depressive, psychoneurotic, 
and schizophrenic patients were utilized. Blood glutathione, lactic acid, and 
pyruvic acid concentrations were determined from blood samples taken at 
2, 5, 10, 20, 40, and 60 minutes after a standardized period of exercise. It was 
found that: (a) lactic and pyruvic acid values were significantly greater in 
females than in males; (0) no significant relationships appeared with respect 
to glutathione; (c) pyruvic acid values were significantly lower for normals 
than for the groups of patients; and (d) lactic acid values were significantly 
lower for normals than for the schizophrenic group. No inverse relationship 
between reduced glutathione and lactic acid was found for schizophrenics. 
These interactions were tested by analysis of variance and covariance, with 
appropriate control checks for variations in work done as related to the 
weight and height of subjects. The authors reported that significant increases 
in lactic acid production under exercise in schizophrenic patients, as com- 
pared with normals, suggested that glycolysis per se or the reduction of glu- 
cose to lactic acid is an efficient mechanism in schizophrenic patients. The 
apparent need for greater lactic acid production, however, suggested a dis- 
turbance in the mechanism for the biochemical transformation of chemical 
into mechanical energy rather than in the production of lactic acid itself. 
Further studies might well be oriented toward problems of phosphorus me- 
tabolism in schizophrenic patients. 

The presence of a hyperglycemic factor in the urine of so-called schizo- 
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phrenic patients has been reported by Meduna & Vaichulis (46). Meduna & 
McCulloch (47, 48) have reclassified these schizophrenics as ‘‘oneirophren- 
ics’ and have suggested a correlation between a hyperglycemic factor in the 
urine and an abnormal carbohydrate metabolism. A recent study was re- 
cently reported by Morgan & Pilgrim (49) which made a significant contribu- 
tion to knowledge about this problem. These investigators confirmed the 
presence of a hyperglycemic factor in the urine of some schizophrenics. 
Methods for concentrating the factor were described and data presented for 
the yield and potency of the various fractions isolated. The factor has been 
tentatively identified as a protein or a substance closely bound to a protein. 
By a comparable fractionation of normal male urine, a measurable quantity 
of the hyperglycemic factor was not found. This study represents a step for- 
ward in the clarification of complex biochemical states underlying one of the 
most prevalent types of mental disorder. 

Jacobs & Tempereau (50) published on the relationship of a proteolytic 
enzyme in schizophrenic patients. The subjects were 96 male schizophrenics 
who had not received previous psychiatric treatment. Thirty-six of these 
patients showed a high concentration of the enzyme (rennin-inhibitor). These 
patients also exhibited the following behavioral response patterns: (a) a low 
tolerance to frustration in the clinical situation and (b) the appearance of con- 
fusion, amnesia, or stupor in response to stress ordinarily ineffective in pa- 
tients of this type. These authors felt that the clinical group having a high 
concentration of the enzyme most closely fits the traditional classification of 
catatonia. 


ADRENO-CorRTICAL MECHANISMS 


During the past year a great many studies have appeared in the general 
scientific literature concerning adreno-cortical hormones. For the most part, 
however, these studies dealt with chemical and strictly physiological vari- 
ables which are not the primary concern of this review. A few, however, are 
pertinent and will be covered. 

Rosvold et al. (51), in a well controlled study, showed that when daily 
electroshock convulsions were administered to a group of 40 rats for a period 
of 10 days there was an accompanying increase in the size of the adrenals. 
The same type of experiment was repeated, using rats anesthetized by ether 
and pentobarbital sodium [nembutal; sodium 5-ethyl-5-(1-methylbutyl) 
barbiturate]. Electroshock was given to this and to a control group which 
was not anesthetized. No convulsive seizures occurred in the anesthetized 
group. Upon examination 10 days later the convulsed, control group showed 
enlarged adrenals while the anesthetized group did not show an increase. 

Del Pozo (52) analyzed the possible effects of cortisone on the contraction 
of normal skeletal muscles. The experiment made use of cats with Achilles 
tendons cut and tied to a myograph. Biphasic electric shocks with a Grass 
stimulator were applied to the centrally cut sciatic nerve through shielded 
electrodes. Maximal shocks were used and several frequencies tested. Intra- 
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arterial injections were made through the inferior mesenteric artery periph- 
erally ligated and with retrograde cannulation. Intravenous injections were 
made through the jugular vein. It was found that doses of soluble cortisone 
of 6 mg./kg. produced an increase in amplitude of muscular response to the 
corresponding nerve. The enhancement of muscular contraction was also 
shown in the improvement of tension maintenance. Del Pozo pointed out 
that it was previously thought necessary to have an adrenal-cortex insuffi- 
ciency to detect the effect of cortical hormones on muscular contractions. 
The results of this experiment show that this is not always the case. 

Torda & Wolf reported three studies on the effects of ACTH? (53, 54, 55). 
It was shown that prolonged administration of ACTH increased the ability 
of the brain to synthesize acetylcholine (53). This increase amounted to 60 
per cent in both hypophysectomized and nonoperated dogs. Using adrenal- 
ectomized rats a 20 per cent increase in synthesis of acetylcholine was ob- 
served. In another study (54) it was shown that in hypophysectomized dogs 
there was a decrease in the capacity to maintain action-potential amplitude 
in muscle fibers. ACTH restored the loss of this capacity, which remained 
unaltered during repeated stimulation. Torda & Wolf (55) also reported that 
ACTH increased the electrical activity of the brain; the EEG’s showed in- 
creased voltage, occasional spiking, paroxysmal runs of low frequency, high 
voltage waves, and lowering of the convulsive threshold to pentomythelene 
tetrazol. The ACTH showed similar effects in hypophysectomized, adrenal- 
ectomized, nonoperated, and sodium injected animals. It was also shown 
that when the ACTH was administered in large quantities over long periods 
of time a decrease in cortical activity resulted. 

Berkeley (56) tested the hypothesis that subjects who showed the widest 
divergence between goal and performance in a level-of-aspiration setting 
would also show the greatest increase in adrenocortical activity. Thirty-one 
males were used as subjects (only three of them being college students). 
These performed a manual level-of-aspiration task using an apparatus that 
could be manipulated by the experimenter to produce either success or failure 
by a desired amount. The score used was the difference between score ob- 
tained and score expected. Stressfulness was evaluated by the production 
rate of urinary 17-ketosteroids during the test period as against pretest con- 
trols. Discrepancy between aspiration and achievement with respect to both 
success and failure conditions was related to increase in output of 17-keto- 
steroid hormone. A multiple R of +.70 was obtained which is significant be- 
yond the 1 per cent level of confidence. 

Carr (57) tested the hypothesis that adrenalectomizing a rat lowers the 
taste threshold for sodium chloride when threshold is measured in behavioral 
terms and is uninfluenced by the rat’s natural preference for salt. The rats 
were conditioned to drink salt water in preference to distilled water by shock- 
ing them if they chose distilled water and by varying the position of the vials 
in a random manner. After the conditioned response had been established 
the concentration of salt was decreased until frequency of its choice was not 
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significantly greater than that of plain water. The preoperative rate of intake 
of a 3 per cent salt water solution was determined. Adrenalectomy was then 
performed and the postrecovery intake of 3 per cent salt water was again 
measured for the surviving animals. Postrecovery rate was significantly 
different from preoperative rate. The avoidance conditioning paradigm was 
again instituted, and it was shown that rather than a decrease of salt being 
necessary for discrimination the amount actually increased, although the 
increase was not significant. Carr concluded that the ‘‘need” for sodium 
chloride did not influence the ability of rats to detect it. 

Siegel (58) injected rats intraperitoneally with blood serum drawn from 
hungry donors. These injected animals ate no more food over the subsequent 
two-hour interval than similar rats injected with blood serum from satiated 
donors. The findings were interpreted as having negative implications for a 
“hormone”’ theory of hunger. The volume of blood injected was inversely 
related to subsequent food intake of the animals. 


PHYSIOLOGICAL APPROACHES TO CLINICAL PROBLEMS 


Wishner (59) undertook a study of the possible relationships between 
physiological and Rorschach measures of behavior characteristics. The data 
sought were in the area of the relationship between overt and physiological 
activity. Eleven psychoneurotic patients with anxiety as a major compo- 
nent, and 10 normals took the Rorschach test. On the same day, measure- 
ments of the physiological activity of the subjects were made under various 
stimulus conditions. These measures included electromyograms from the 
frontal and masseter muscles, electroencephalograms, heart rate, respiration 
rate, and the galvanic skin response. None of the Rorschach scoring cate- 
gories, considered singly, differentiated between the two groups. Heart rate 
was significantly higher for the neurotic group, and respiration rate also 
tended in this direction. The normals had higher galvanic skin responses and 
muscle action potentials from the frontal muscles. 

This past year was noted for additional research (see 60, 61, 62) on the 
relations between drive (anxiety) and learning, and these studies are consid- 
ered by other reviewers. Although at present some of this research is not 
specifically oriented toward either clinical or physiological psychology, it is 
probable that in the near future both of these fields will feel the influence of 
this approach. 

Malmo, Davis & Barza (63) reported an experimental study of a 19- 
year-old girl with apparently complete hysterical deafness. Electromyo- 
graphic reactions to novel and intense auditory stimulation, and auditory 
conditioning were recorded. Initial auditory stimulation produced a startle 
reaction and emotional disturbance. Repetition of the stimulus one minute 
later produced no reaction and no change in an electromyogram. Auditory 
conditioning was obtained, the conditioned responses being confined to mus- 
cle potentials. The therapeutic usefulness of conditioning procedures in 
cases of hysterical deafness was suggested. 





XUM 








viIw 


PHYSIOLOGICAL PSYCHOLOGY 259 


Klein & Krech (64) reasoned that if prolonged exposure to stimulation 
results in decreased conductivity of cortical tissue, this effect will be accen- 
tuated in cases of severe brain lesions. They compared brain-injured patients 
and controls on the degree and rate of satiation after exposure toa kinesthet- 
ic stimulus. The subjects, while blindfolded, rubbed the sides of a wooden 
standard and after varying periods of time judged the width of a control 
object by adjusting a tapering scale. Persistence of the effect was measured 
at stated intervals. It was found that the frequency and intensity of satia- 
tion effects were greater in the brain-injured. Also, in such patients satiation 
effects were reached more quickly and persisted longer than in normal con- 
trols. These findings suggested that a generalized effect of severe brain-injury 
is lowered conductivity. 

Lazarus, Yousem & Arenberg (65) carefully studied the relationships be- 
tween hunger and perception. Two experiments were described in which 
hunger was varied and thresholds of recognition determined tachistoscopi- 
cally for food and nonfood objects in 110 subjects. Differential thresholds of 
recognition for the food versus nonfood items varied significantly with the 
degree of food deprivation and subjective hunger. Fifty-one additional sub- 
jects made guesses from a list of 16 names of objects, 10 of which were used 
in the tachistoscopic presentation. Half the list contained food items and 
half nonfood items. In this multiple-choice form of the experiment no rela- 
tionship was found between hunger and perceptual recognition. The authors 
concluded that with precisely defined stimuli under unlimited exposure, 
tachistoscopic recognition thresholds are a function of the hunger need. This 
relationship, however, did not occur when the response variation of the sub- 
jects was limited by a multiple choice situation. 

Semmes (66), in a stimulating review on experimental approaches to 
agnosia, suggested that in man, because of his capacities, injury, disease, 
or both, often produce disorders such as agnosia, apraxia, and aphasia which 
offer an extraordinary and largely unexploited field for research. The nature 
of behavioral impairment in primates following lesions of the parieto- 
temporo-preoccipital neocortex and of the “‘associative’’ cortex were dis- 
cussed. She suggested two possibilities as to the nature of these deficits as 
they may relate to similar disorders in man: (a) there may be a loss of the 
capacity to utilize normal sense-data, as a basis for learned discriminations; 
or (b) the sense-data themselves are altered in an as yet unexplained way. A 
program of research was described in which these and other questions related 
to aphasia can be tested more feasibly with a series of brain-injured, human 
subjects. Suggestions were given for research on how somatic sensation and 
related abilities are altered by central and peripheral nervous lesions. 

The New York group of investigators has maintained a continued inter- 
est in the systematic study of individuals who have suffered brain injuries 
(67 to 70). An earlier review by Bender & Teuber (71) critically summarized 
current ideas on the relationship of brain damage in humans to symptoms 
such as agnosia and alexia. A recent study by Battersby, Teuber & Bender 
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(72) presents data on problem solving behavior in men with frontal or oc- 
cipital brain injuries. 

Eighty World War II veterans were tested on a modification of the Maier 
String Problem. These subjects had sustained three types of injury: pene- 
trating frontal, parieto-occipital, and peripheral nerve wounds (controls). 
Results with this test showed that brain-injury in frontal or parieto-occipital 
areas is less disruptive of problem solving than was indicated in previous 
reports. Some subjects took longer and were dependent on hints in achieving 
solution, but the group differences were too small to reach significance. 

In normal subjects ability on the reasoning problems employed corre- 
lates more highly with ability to perceive hidden contours (Gottschaldt 
figures) than with ability to sort to a criterion (Weigl test). Neither of these 
correlations was significant in the brain-injured groups. Patients with ante- 
rior lesions did no worse than those with posterior injuries. These findings are 
not easily compatible with many current assumptions that the integrity of 
the frontal lobes is of singular importance for the solution of problems similar 
to those employed in these studies. 

Marshall, Walker & Livingston (73) investigated the parameters of 
stimulation in a case of photogenic epilepsy. By use of intensity attenuation 
of a stroboscopic light with Polaroid filters, and also by noting the light re- 
quired to give a positive EEG reaction of increased voltage with slowing, it 
was demonstrated that red light was at least 10 times as effective in trigger- 
ing convulsive attacks as other types of colored light. Glasses eliminating 
this light component were of considerable aid to the patient. 

Several reports have described new approaches or methods useful in 
physiological approaches to behavior, among them being Flanders (74) who 
has described a circuit for the continuous measurement of palmar resistance. 
The requirements that should be met if ‘‘resistance level”’ is to be useful as a 
measure of palmar skin resistance are outlined. Also, suggestions for meeting 
these requirements, based on equipment developed and used over a period of 
several years, were proposed. 

In a related study on methodology, Grings (75) compared three different 
manifestations of electrical skin activity in human subjects. Data were pre- 
sented on skin impedance as a function of the frequency of measurement cur- 
rent, resistance versus reactance components of skin impedance, and the 
relation between skin resistance and the magnitude of the measuring cur- 
rent. Relationships between AC and DC resistance components were shown 
to be linear, and the relation between skin potential and impedance quanti- 
ties was nonlinear. 

Other apparatus papers of interest are Baker & Taylor's (76) description 
of methods for recording changes in skin temperature. Barger & Gebhard 
(77) have developed an electronic stimulator for use on the human eye. This 
apparatus is of interest because of recent investigations in which the thresh- 
old of electrically induced sensations of light is used as an indicator of the 
effect of photic stimulation on the activity of visual mechanisms. 
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ABNORMALITIES OF BEHAVIOR! 


By JuLeEs H. MAssERMAN, ZOLTAN Gross, AND CurRTIS PECHTEL 


Department of Nervous and Mental Diseases, Northwestern University, Chicago, Illinois 
and the Downey Veterans Administration Hospital, Downey, Illinois 


As in previous years, surveys of specialized literature in this field will be 
found in the Annual Survey of Psychoanalysis (42), the American Journal of 
Psychiatry, (3), Progress in Neurology and Psychiatry (135), and the Year 
Book of Neurology and Psychiatry (79). In the present chapter we shall at- 
tempt to integrate those experimental and clinical studies possessing general 
psychologic interest. 


EXPERIMENTAL INVESTIGATIONS 


Fixation and avoidance responses in animals.—Wilcoxon (150) found in a 
study of rats trained on the Lashley jumping apparatus under various condi- 
tions that partial reinforcement played a primary role in the development of 
stereotyped reactions, that such responses need not be considered abnormal, 
that they could easily be eliminated by training and that the data accorded 
so well with Hullian stimulus response theory as to render unnecessary 
Maier’s concepts of ‘‘fixation habits.” Bailey & Miller (7) taught 5 kittens 
and 12 cats to lift covers from food rewards and then shocked the animals to 
superimpose avoidance reactions. The latter were not affected by control 
injections of saline, but were mitigated by the administration of sodium 
amytal. The authors believe that amytal has a specific depressant effect on 
fear responses rather than acting like other hypnotics in disorganizing re- 
cently learned, complex and therefore relatively vulnerable patterns of be- 
havior. 

Physiologic correlates—Beach (8) demonstrated in a comprehensive re- 
view of the field that psychosomatic responses can be elicited and studied 
advantageously in all species of animals. Mahl (82) in a special study of the 
physiologic concomitants of acute and chronic fear, measured gastric acid- 
ity and blood sugar levels in two monkeys at rest, in acute fear, and in chronic 
fear. Mahl’s results were not in accord with Cannon’s emergency theory, 
because in his data HCI secretion did not increase in response to acute stress 
but remained abnormally high during chronic fear, whereas the blood sugar 
level varied independently from the HCI. 

Experimental neuroses—Masserman & Pechtel (87) produced severe 
and persistent aberrations of behavior in monkeys by subjecting them to re- 
peated conflicts between learned feeding patterns and apparently innate 
aversions to the appearance of a toy rubber snake. The experimental neu- 
roses thus induced were characterized by inhibitions of feeding, generalized 


1 The survey of the literature to which this review pertains was completed in April, 
1953. 
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phobias, gastrointestinal and cardiorespiratory dysfunctions, motor stereo- 
typies or catalepsies, markedly increased homo- and autoeroticism, “‘hallu- 
cinatory” behavior (i.e., the animals would appear to seek and eat only 
imaginary pellets while avoiding real food), and marked changes of social 
conduct in relation to other animals or to the experimenter. The specific con- 
figurations of these patterns and the methods effective in individual and 
group therapy tended to vary with the species of monkey rather than with 
the nature or intensity of the trauma. 

Wolpe (161) reported his observations on 12 cats subjected to grid 
shocks with or without learned feeding responses, and in general confirmed 
the results of similar experiments by previous workers. Nevertheless, he 
objects to Masserman’s concepts of motivational conflict and Maier’s thesis 
of “abnormal behavior fixations,’’ and proposes instead that experimental 
neuroses are induced (a) by ‘‘ambivalent stimulation” arising from difficult 
discriminations, delay before reinforcement, or rapid reversals, and (b) 
“‘noxious stimuli’ which interrupt learned responses. Wolpe postulates fur- 
ther that the specific patterns of the neurosis are learned and persist because 
the drives producing them are reduced by some degree of temporal and spa- 
tial escape from the anxiety-engendering stimulus. It may be noted, how- 
ever, that these formulations eventuate in about the same procedures both 
for inducing and treating the neurotic deviations. 

Masserman (86), in a review of the experimental data and basic concepts 
of biodynamics in relation to current tendencies to reintegrate biology, psy- 
chology, and medicine, concludes as follows: 


This, then, is a summary . . . of a long series of experiments designed to analyze the 
biodynamics of behavior and to discern principles that would apply alike to ‘‘normal’’ 
and “abnormal” conduct, to animal and human subjects, and to experimental and 
clinical therapy. The gap between the responses of cats, dogs or monkeys in cages and 
the conduct of man in society is undeniably wide; certainly man, of all creatures, has 
developed the greatest facility in experiential association and integration, the highest 
capacity for symbolic, verbal and other imagery, and the most elaborate repertoire of 
“normal,” ‘‘neurotic’”’ and “psychotic”? behavior patterns in a constantly changing 
social and cultural milieu. And yet, as elsewhere in medicine, the best way to unravel 
an especially complex problem is to take it into the laboratory as well as the clinic, 
investigate it by specially designed experiments, check their results with a rigid self- 
discipline that eliminates subtle errors and cherished preconceptions, and so advance 
bit by bit toward clearer formulations of general principles and more pertinent applica- 
tions of them. In psychiatry, such experimental and operational approaches, when cor- 
related with clinical practice, may not only dissolve the verbal barriers among the 
various schools and methods but may also foster a needed rapprochement between 
psychiatry on one hand and scientific medicine and the humanities on the other. 


THE NEUROSES 


Nosology — During 1952 the fourth revision of the Diagnostic and Statisti- 
cal Manual of Mental Disorders (109) was prepared by the Committee on 
Nomenclature and Statistics, American Psychiatric Association, and pub- 
lished in official form. A number of important changes in the etiologic and 
\ 
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diagnostic concepts of various behavior disorders embodied in this manual 
may be reviewed as follows. 

The new classification of mental disorders is proposed because (a) ‘‘only 
about 10 per cent of the total cases seen in the Armed Services fell into any 
of the categories ordinarily seen in public mental hospitals” (p. vi), and (0d) 
two interim revisions (of the last official edition) by the Army and by the 
Veterans Administration had not proved completely satisfactory. The cur- 
rent manual explicitly (p. 9) 


recognizes the present day descriptive nature of all psychiatric diagnoses, and at- 
tempts to make possible the gathering of data for future clarification of ideas con- 
cerning etiology, pathology, prognosis and treatment in mental disorders . . . (and) for 
inclusion of new ideas and advances... without radical revision of the system of 
nomenclature. 


In general, the manual fulfills these purposes, although there is little ap- 
parent logic of organization or codification in some of the proposed revisions. 
For example, on p. 92 the heading ‘‘¢+328 Chronic brain syndrome NOS” is 
inexplicably followed by a coequal heading ‘321.1 Passive dependency 
(tt includes passive-aggressive personality),’’ but nowhere is there any ex- 
planation of the system of coding, nor could a cryptographer of only ordin- 
ary skill divine the principles employed. Again, many specific sounding diag- 
nostic distinctions without operational differences are retained: thus, ‘‘Psy- 
choneurotic disorders (310-318, ¢t328) with or without somatizations, are 
listed separately from disorders of “Character, behavior and intelligence 
(320-326, t*327, tt328)”’ with no explanation as to how one could be neurotic 
without ‘‘a disorder of behavior’’ as the only index of disturbances either in 
‘intelligence’ or “character.” Moreover, some of the proposed definitions 
lean too heavily on psychoanalytic or psychosomatic tenets that are as yet 
far from being sufficiently clear or valid. Thus, on p. 29, it is asserted that 
“‘psychophysiologic autonomic and visceral disorders... represent the 
visceral expression of affect which may be thereby largely prevented from 
being conscious,’’ whereas the criteria used to differentiate organ neuroses 
from conversion, (itself a dubious dynamic concept) are that organ neuroses 
(a) involve the autonomic rather than the voluntary nervous system, (5) do 
not relieve anxiety, (c) are ‘physiologically rather than symbolically” de- 
termined, and (d) result in the “frequent production of structural changes 
which may threaten life.” Somewhat less dogmatic but still controversial is 
the new taxonomy of the psychoneuroses, which are subclassified according 
to the manner in which “anxiety is . . . expressed . . . or controlled by vari- 
ous psychological defense mechanisms’: e.g., dissociation, conversion, 
phobias, or depression. Under the ‘‘Personality Trait Disturbance” (another 
original category) various time-worn ambiguities (e.g., ‘‘Emotionally un- 
stable personality’’) are also preserved; in this rubric, too, the new term 
“sociopath” (which replaces the epithetic psychopath) now includes the sub- 
types of antisocial, dyssocial, sex deviate, and drug addict. Supposedly 
cyclic disorders of behavior formerly termed manic depressive psychoses are 
more appropriately termed affective reactions with ‘manic, depressed, and 
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other manifestations. Even more boldly, the categorical diagnosis of schizo- 
phrenia has been discarded in favor of schizophrenic reactions, with acute 
undifferentiated, chronic undifferentiated, schizo-affective, childhood, and 
residual forms in addition to the usual hebephrenic, catatonic, and paranoid 
subtypes. Neurasthenia and hypochondriasis have, at long last, been elim- 
inated altogether as disease entities, a loss to patients and physicians who 
long used them as convenient euphemisms, but a boon to more conscientious 
nosologists. The two most novel proposals in the new nomenclature, how- 
ever, are these: first, to make the diagnosis of ‘‘mental deficiency”’ subsidiary 
to “chronic brain syndromes” with a primarily organic causation, and sec- 
ond, the addition of one of four phrases: ‘‘x1 with psychotic reaction,” ‘‘x2 
with neurotic reaction,” ‘‘x3 with behavioral reaction,’ and ‘x4 with mental 
deficiency” to any diagnoses which need to be so modified. Finally, the 
manual incorporates the Army and Veterans Administration practice of add- 
ing specific notations as to the following: (a) external precipitating stress, 
(b) premorbid personality and predisposition, and (c) the degree of psychia- 
tric impairment, i.e., minimal, mild, moderate, or severe. 
Each of these changes will, no doubt, be approved and decried for differ- 
f ent reasons by almost equal numbers of specialists in the field. Those trained 
in traditional nosology may be horrified by the progressive departures from 
the Kraepelinian system; others more dynamically inclined may point out 
that the manual still retains terms like psychoneurotic, cyclothymic, or for 
that matter neurotic and psychotic, the meanings of which, obscure enough 
| originally, have become even more confused by decades of deviant interpre- 
| tations. Nevertheless, many sections of the manual evince clear thinking, 
hard work, and reasonable compromise among conflicting tenets in the field; 
for this, Dr. George N. Raines, Chairman of the Committee, and his col- 
leagues have earned the gratitude of all who for research and clinical pur- 
poses must attempt to classify the protean aberrations of human behavior. 
| Unfortunately, the nosologic confusion that the manual attempts to cor- 
{rect is still evident in much of the literature. Thus, Liber (74) attacks the 
term ‘‘neurosis” on the grounds that it encourages “‘loose thinking”’ in psy- 
chiatry and is too often used to cover errors in medical diagnosis. Similarly, 
Bowman & Rose (13) deplore the use of the phrase ‘‘sexual psychopath,”’ where- 
as objections are also raised by Ascher (5) against the diagnosis of ‘“‘neurotic 
depression’ and by Weiss (147) against ‘‘neurocirculatory asthenia.’’ Con- 
versely, Cohen et al. (23) accept ‘‘neurocirculatory asthenia’ and attempt to 
ascertain its ‘familial prevalence’? whereas Wilson (153) describes still an- 
other ‘‘neurosis of everyday living’’ characterized by an ‘‘unconscious exag- 
geration of normal social patterns’’ such as race prejudice. Friedman (41) 
believes that neurotic patients may be distinguished from psychotic in that 
the actual or the screen memories of childhood in the neurotic group show 
neither a loss of personal identity nor a marked fear of such loss. 
Interviewing.—In a comprehensive manual Menninger (90) gives detailed 
directions for various techniques of examining patients and their families, 
for recording case histories, and for planning therapy. Masserman (85) 
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offers a briefer set of suggestions to aid the general practitioner in the essen- 
tial psychosomatic supplementation of his methods of history taking, phy- 
sical examination, and differential diagnosis, and Whitehorn (149) discusses 
anew the diagnostic and therapeutic psychodynamics of various interviewing 
techniques. In this connection Cameron (18) and Karnosh & Quinn (65) 
point out that of all the neuroses encountered in private practice anxiety 
states are the most frequent, the most easily diagnosed, and the most amen- 
able to skillfully directed therapy. 


RESEARCH STUDIES IN DIAGNOSIS 


Physiologic—Persky et al. (101) studied the excretions of hippuric acid 
after the administration of sodium benzoate to three groups of adults: nor- 
mal, those exhibiting free anxiety, and those diagnosed as catatonic schizo- 
phrenics. When the normals were exposed to transient stresses of moderate 
severity, their hippuric output did not vary markedly; in contrast, the sub- 
jects with anxiety showed an increased average excretion correlated directly 
with the intensity of the affect, whereas the catatonics showed the lowest 
excretory rate with a tendency toward normal levels as the patients’ be- 
havioral status improved. Hemphill et al. (52) report that adult female pa- 
tients in anxiety states feel fatigue more easily and have a lower perceptual 
sensitivity to pain than do patients with endogenous depressions. Seitz & 
Shipley (127) believe that there are demonstrable relationships between 
thoughts expressed by a patient and specific changes in his psychogalvanic 
skin responses as recorded continuously during a clinical interview; however, 
to one acquainted with the technical pitfalls of PGR determinations the 
correlations may appear to be questionable. 

Psychologic—A number of investigators have been interested in deter- 
mining the effects of anxiety on various physiological and psychological 
processes. In some studies anxiety was presumably induced by the threat of 
mild shock or the necessity of scoring adequately on a test; in other instances 
persons who have in some way been diagnosed as anxious were compared 
with a normal population. Thus, Taylor & Spence (141) after choosing popu- 
lations differentiated as extremes on the scale of Taylor's special test of 
anxiety and comparing the two groups in serial learning performance, found 
that anxious subjects were significantly inferior to the controls in learning 
competing responses (see also the chapter on Personality). 


PsyCHOSOMATIC DYSFUNCTIONS 


Owen (139), in a recent symposium points out that localized somatic 
lesions often have broad ‘‘holosomatic”’ effects, and that physicians who 
treat only the local lesion will fail if they neglect its total setting. Dewan 
(139) states that mild depressions may simulate peripheral disease and that 
symptoms arising from anxiety about a particular organ must be differen- 
tiated from symptoms attributable to structural alterations in that organ. 
Williamson (139) notes in this connection that most complaints about heart- 
trouble in general practice are a result of cardiac neuroses, whereas Mac- 
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Donald (139) contends that illness and dependency would diminish among 
older people if our society did not encourage their early retirement. Prick 
& Calon (106) derive psychsomatic illness from an underlying neurosis, and 
add that infantile effects and impulses determine the organic area to be in- 
volved. Pearlman (99) adds that such illnesses may be used to maintain a 
comfortable equilibrium in which the patient gratifies covert dependent and 
narcissistic needs which would otherwise not be consciously acceptable. As 
to specific psychosomatic reactions, Abramson (1) acknowledges the allergic 
basis of bronchial asthma but insists that its intensity and persistence is 
maintained in an appreciable number of patients by unconscious conflicts. 
Fowler & Zeckel (37) report that common neurotic patterns and precipitating 
factors were found in a group of 23 patients with Méniére’s disease; con- 
versely Tissenbaum et al. (144) warn that Parkinson’s syndrome, multiple 
sclerosis, and expanding brain lesions have often been initially misdiagnosed 
as psychiatric disorders. 


ETIOLOGY OF THE NEUROSES 


Childhood experiences.—Spitz (136) reiterates the thesis that when con- 
genital disease and other known physical factors are eliminated, the mother- 
child relationship is a major factor in the etiology of psychogenic diseases in 
‘infancy. Wolff & Bayer (160) believe that parents overly concerned with the 
infant’s bodily functions may encourage later somatizations of neurotic 
symptoms. Johnson & Szurek (61) state that parents may unwittingly and 
also for vicarious gratification seduce the child to act out their own poorly 
integrated or forbidden impulses, thus establishing in the child superego 
defects which duplicate those of the parents. Wulff (162) and Sandford (123) 
likewise comment on the similarity of childhood neurotic symptoms and 
adult neuroses and attempt to explain the resemblance in conformity with 
standard psychoanalytic theory. Jackson et al. (59) note that children well 
adjusted to a favorable family environment show less emotional trauma when 
hospitalized. Johnson & Szurek (61) differentiate between neurotically hos- 
tile aggression in children and biologic ‘‘aggressiveness,’’ a term which these 
and other writers seem to confuse with initiative. Sager (121), on the other 
hand, contends that aggressive behavior is always a reaction to injury or 
frustration and is not itself an instinct. 


PREVENTION OF NEUROSES 


Scheidlinger (124), Joseph et al. (64), and Luton (77) again point out the 
need for early recognition and control of neurotic processes through the prac- 
tice of mental hygiene in schools, special clinics, and community programs. 
Bettelheim (12) and Sontag et al. (134) repeat the familiar theme that pre- 
ventive psychiatry must deal with the factors that impair every child’s 
ability to master his inner conflicts and the tasks imposed by society. In 
more general reviews, Coleman et al. (24) describe the advantages of the 
team-work approach in treating the child and its parents simultaneously, 
and Bender (9) outlines the various techniques, ranging from investigations 
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of form perception to the development of group relationships which she has 
utilized with thousands of problem children at Bellevue Hospital. Arthur 
(4) asserts that only psychoanalysis can make conscious the repressed 
determinants of behavior disorders in children; however, without defining 
his concept of psychoanalysis, Arthur admits that nonanalytic therapy of 
both the parents and the child can be effective in alleviating undesirable 
patterns of conduct. 

Hulse (56) reverts to the notion, long ago modified even in Freudian 
theory, that sexual frustration may be the primary cause of tension and 
anxiety, and therefore believes that the general practitioner should advise 
his neurotic patients to permit themselves greater freedom in conjugal 
relationships, apparently as a measure of mental hygiene. 

Psychopathy and social deviation—Sturup (138) points out that crimi- 
nality is only an expression of society’s rejection of the patient’s form of 
conduct, as determined by his unique constitutional, psychophysical, cul- 
tural, and social background. The solution of the problem of deviations of 
behavior must therefore lie in a rapprochement of the two norms, though this 
is admittedly a difficult and often impossible condition. Nevertheless, Weber 
(146) believes that if the psychopath could be reached by intensive psycho- 
therapy before late adolescence, sufficient advantage could be taken of his 
capacity for genuine affection and his tendency to identify with strong lead- 
ers to make the prognosis for social readjustment relatively favorable. 


THERAPY OF THE NEUROSES 


Jackman & Schorr (58), basing their conclusions on 25 cases, report that 
carbon dioxide inhalations are effective in decreasing anxiety, introspection, 
and conflict, and compare the method to a ‘‘chemical frontal lobotomy.” 
However, Freeman (39) points out that in both ether and carbon dioxide 
therapy, unconscious dynamic forces are involved and must be assessed in 
the results. 

Drug therapy—Miller (92) employed 120 to 220 mg. of amobarbital 
sodium (amytal sodium; sodium 5-ethyl-5-isoamylbarbituric acid) in 20 to 
25 cc. of 10 to 15 per cent ethyl alcohol intravenously for ambulatory nar- 
coanalysis and reports that alcohol appears to counteract the unwanted 
side effects of the barbiturate and to facilitate verbal and affective release. 
Masserman & Pechtel (88) could demonstrate no beneficial effects of mephe- 
nesin (3-otoloxy-1, 2 propanediol) on neurotic animals; nevertheless Paul (98) 
reports relief of anxiety and tension in 26 of 30 neurotic patients given this 
drug and postulates that it produces muscle relaxation without hypnotic 
effects by acting on spinal internuncial neurons. Hill et al. (53) conclude that 
morphine reduces the disruptive effects on performance produced by the 
anticipation of pain. 

Hormonal.—Brody (17) reports that cortisone and ACTH therapy may 
exacerbate adverse emotional responses ordinarily repressed in the absence 
of adrenal overstimulation. Lidz et al. (75) believe that the euphoria some- 
times observed during ACTH therapy is in large part attributable to the 
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relief of physical distress and may, as Hollender (54) warns, lead to addic- 
tion. Conversely Braceland (14) notes that anxiety and depression may 
supervene if the patient had unconsciously regarded his previous invalidism 
as a neurotic advantage. 

Psychotherapy.—The literature in this field for 1952 fell mainly under 
the rubric of technical discussions of psychoanalytic technique, as reviewed 
by Saslow in this volume. Alexander & Ross (2) once again question the 
spurious dichotomies between so-called (a) psychoanalytic or deep as opposed 
to (b) superficial or brief techniques, and advocate instead more valid dis- 
tinctions among (i) stereotyped and inefficient as contrasted with (ii) more 
skillful and effective methods of therapy. 

Rennecker & Cutler (113), extending psychosomatic postulates to one 
extreme, recommend that in the therapy of cancer of the breast in women 
the patient’s libido be mobilized to combat the illness. In practice, this is 
facilitated by complete candor leavened with emphasis on prognostically 
favorable factors and sympathetic aid in resolving her sexual, familial, and 
social anxieties. Relevant to such utilization of transference is Brewster’s 
(15) reminder that dependent patients under intensive psychotherapy may 
suffer disruptive ‘‘separation reactions” during interruptions of their 
analysis. 

Hypnotherapy.—Schenck (125) treated a patient with zoophobia by con- 
trolling anxiety through posthypnotic amnesias, and Paley (96) reports five 
cases of chronic alcoholism in which short term hypnosis encouraged ‘‘psy- 
chic surrender”’ and facilitated insight. Rosen (120) advocates hypnotherapy 
as a means of inducing emotional abreactions in patients who would other- 
wise be repressed and resistant; however, Rosen, apparently in accord with 
Bernheim’s dictum “‘it is a wise hypnotist who knows who is hypnotizing 
whom,” warns against the danger of participating too freely in the patient’s 
fantasies. Finally, Reid & Finesinger (112) discuss insight in psychotherapy 
and discard the usual dichotomy of intellectual versus emotional under- 
standing. Instead, they distinguish the following forms: (a) generic, i.e., 
‘any cognitive act by which we grasp the significance of some pattern of 
relations,’ (b) neutral, which denotes intellectual understanding without 
affective discharge, and (c) dynamic, which alone is capable of inducing 
lasting therapeutic effects by objective and operational standards. It is not 
at all necessary that an interpretation be true to be effective; it need only 
be believed. However, in the authors’ words, in some patients even this would 
only be adding “insight to injury.” 


ABNORMAL BEHAVIOR IN MILITARY SETTINGS 


Selection and performance——According to Peterson & Chambers (102) 
only the helplessly dependent or aggressively immature need be excluded 
from service, since other neurotic characteristics are often compatible with 
military efficiency. Potter (104) studied 174 men separated from the Navy 
for unsuitability and concluded that although a large number of them had 
enlisted to escape an intolerable situation or were looking for excitement and 
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adventure, there were no constant patterns in such individuals that would 
preclude their induction. Brill & Beebe (16) likewise point out that many 
men who seemed well adjusted in civilian life break down in combat, whereas 
others known to be neurotic functioned well; therefore, only those who 
clearly demonstrate an inability to adjust after a trial in service need be 
discharged. 

Combat neuroses Wilkinson et al. (151) attribute the prolonged latent 
period of combat neuroses to the tendency of the soldier, for reasons of 
pride or group morale, to deny his anxiety and fatigue until he has reached 
the breaking point. Futterman & Pumpian-Mindlin (44) hold that the high 
frequency of traumatic war neuroses among rear echelon military personnel 
and immobilized combatants occurs because of their inability to discharge 
through motor activity the emotional reactions elicited by warfare. In other 
cases, war neuroses often involve guilt about assaulting or killing defenseless 
enemy personnel. 

Reactions of prisoners of war.—Nardini, (94) on the basis of his personal 
observations during three and one-half years as a prisoner-of-war in the 
Philippines and Japan, states that survival depended upon the following fac- 
tors: strong motivation for life with courageous exertion of will, good general 
intelligence, a hardy constitution, the cultivation of emotional indifference 
or a well-managed and balanced sensitivity, a sense of humor, the mainte- 
nance of obligations to others, a controlled fantasy life, successful active or 
passive resistance to the captors, pragmatic opportunism, the self-discipline 
retained from preceding years in the armed services, and plenty of luck. 

Military psychoses—Ripley & Wolf (116), who studied schizophrenic 
reactions in combat areas, conclude that some schizophrenic individuals 
functioned well in the Army despite psychopathologic symptoms for long 
periods prior to their acute episodes. 

Military therapy—Peterson & Chambers (102) state that the aims of 
military psychotherapy in the armed services are not different from those in 
civilian life, but that methods and approaches must be realistically adapted 
to the military situation. Chapman (20) outlines some of the special problems 
of psychiatric practice in a Naval Recruit Training Center, among which 
are the short period available for diagnosis and therapy, the difficulty of es- 
tablishing rapport because of differences in rank between physician and 
patient, the absence of ancillary supporting figures such as the patient’s 
family or friends, and the undeniable secondary gain of the neurosis. 
Grant (48) describes the methods employed in retraining individuals who 
would otherwise be discharged because of unsuitability: the patient’s mal- 
adaptations to military regulations and relationships are sympathetically 
explored, but every effort is made to foster growth and maturity in an at- 
mosphere of understanding; the inductee is taught to conform with certain 
basic rules of conduct by the use of moderate discipline when necessary, but 
individual privileges and group status are concurrently employed as in- 
centives. By such means 59 per cent of 4,800 Navy, Marine, and Coast 
Guard personnel confined at a retraining command were returned to active 
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duty, whereas without such treatment all would have been separated on 
grounds of unsuitability. Wolfe (159), for similar reasons, urges greater use 
of group psychotherapy in the treatment of Naval personnel. However, 
Peterson & Chambers (102) and Brill & Beebe (16) point out that the 
burden of guilt that a soldier feels when evacuated from combat for neurotic 
reasons may be additionally disruptive, whereas a discharge for psycho- 
neurosis may seriously impair the individual's ability to readjust in civil 
life. 

Postcombat.—Noble et al. (95) found a high incidence of psychiatric 
disability on the orthopedic and plastic surgical wards of a military hospital 
and suggest that the recognition and early handling of the emotional prob- 
lems of wounded men can facilitate surgical management and reduce the 
possibility of later psychiatric illness. Schwartz & Inwood (126) found that 
65 per cent of the psychiatric casualties evacuated from Korea to the 
United States were returned to duty in this country and therefore express 
optimism regarding the results of psychiatric treatment based on principles 
learned in World War II. 

Civil defense——Drayer (28) states that the principles of military psychi- 
atry are equally applicable to civilian defense, but believes that it is prema- 
ture to talk of psychiatric methods in this field when the civil defense pro- 
gram is as yet so inadequately conceived and its chance of being effective so 
pitifully small. 


THE PSYCHOSES 


For theoretical and speculative discussions of the psychoses the reader is 
again referred to the reviews listed in the introductory paragraph of this 
chapter. In the remaining sections we shall confine our discussion to literature 
dealing with (a) ‘‘process’’ research on the etiology, nature, and clinical ex- 
pression of the protean abnormalities of behavior usually termed psychotic, 
and (6) clinical investigations of the techniques and results of various meth- 
ods of therapy. 


PROCESS ORIENTED RESEARCH 


This year’s experimental work in psychopathology ranges from tradi- 
tional univariate investigations (145) to those which attempt to incorporate 
personal values into perceptual and reactive functions. Five studies in the 
latter area deserve special mention because they represent newer orientations 
to research in abnormal psychology. 

Rausch (110) and Sanders & Pacht (122), investigating the effects of the 
psychotic process on the perception of size, report that patients tended to 
overestimate the size of the stimulus objects, a phenomenon attributed in 
both instances to the effects of compensation. However, Rausch (110) 
speaks of ‘compensating for threats to the stability of the individual's 
ego structure’ and Sanders & Pacht (122) speak of compensation ‘‘for the 
decrease in the visual angle,” a postulate not as yet integrated with the 
concept of ‘‘ego structure.” These studies, however, point to the possibility 
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of developing correlations between perception of size and the more complex 
patterns ordinarily subsumed under “‘ego structure.’’ Also in the area of 
perceptual functions, Roseman (119) and McGinnies & Adornetto (78) 
interpret their results in terms of the dynamics of other personality vari- 
ables. 

If the recent work by Postman, Bronson & Gropper (103) on the manipu- 
lation of recognition thresholds of taboo words can be generalized to other 
perceptual phenomena, it would appear that the problem of personality or- 
ganization resolves itself into problems of set or attention. This is in accord 
with the suggested orientations of Shakow (129) and Hebb (51) and with the 
expectancy theories of Postman et al. (103). In this context one should also 
note Huston & Senf’s (57) study of attention disturbances in schizophrenia 
and depression. 

Pascal & Swensen’s (97) work on the learning process in mentally ill 
patients supports Shakow’s (129) position that schizophrenia may produce 
impairment in performance, but not in ability or capacity. Other investiga- 
tors (25, 27, 31, 78) this year have also reported that the performance of 
psychotic patients is more variable than the performance of the normal 
controls and that this variability reflects the operation of selective percep- 
tual mechanisms consistent with the organization of the patient but not 
necessarily with the stimulus situation. If this is true it would seem reason- 
able to postulate a specific form of schizophrenic organization rather than 
disorganization. Studies such as those of Eriksen & Lazarus (30) and Laffal 
(72) contain suggestions as to how this dynamic structure of the psychoses 
may be investigated by selected methods specially attuned to the pathologic 
process. 

Factor analytic studies —Another trend this year in experimental psycho- 
pathology has been the increasing use of factor analytic techniques. For 
example, Eysenck’s (31, 32) work points to a productive application of objec- 
tive laboratory methods to the problems of differential diagnosis, with only a 
minimal attempt to define or specify the psychological properties of psycho- 
sis. However, in Eysenck’s discussions, as in those of Crown (26) and of 
Pullen & Stagner (107), the performance of patients on assigned tasks is not 
correlated with the dynamics of their psychoses and evidence supporting 
the hypothesis of differential specificity is given a great deal of attention 
while contradictory evidence is given too little consideration. This is particu- 
larly true of Pullen & Stagner’s study in which eight of nine tests are ex- 
plicitly designed to measure rigidity and the ninth to measure intelligence. 
After the correlation matrix was rotated, a rigidity factor and an intelligence 
factor were extracted without allowance for the fact that four of the tests 
which before rotation were supposed to measure rigidity were later inter- 
preted to tap intelligence. While it is true that the rigidity tests had loadings 
similar to the Wechsler-Bellevue comprehension, arithmetic, and similarities 
tests, it is not clear as to why flicker-fusion overlap is taken as a measure 
of intelligence. Even were such a relationship to be based on Halstead’s 
(50) thesis of biological capacity, the correlations between biological and 
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psychological intelligence would still remain unclear, and the extent to which 
these factors generalize beyond the specific tests employed would remain 
undetermined. 

Projective-—Ritter & Eron (117) attempted to set up statistical criteria 
to differentiate normal from other populations by the use of the Thematic 
Apperception Test. The authors found that when formal and content vari- 
ations were combined in a single score the nonnormal populations were 
clearly differentiated from the normal on the basis of deviant responses, 
although certain normal responses occurred in both groups. Mussen & Krauss 

| (93) employed the Szondi test on a group of students who had been diag- 
nosed neurotic in a university neuropsychiatric clinic, as contrasted with a 
\control group who had never felt the need for psychiatric help; despite Susan 
|Deri’s claims they found the test of little diagnostic or prognostic value. 
McKinnon’s (80) concluding sentences are applicable in this context. 
Knowledge is the result of playing with what we know, that is, with our facts. A 
knowledgeable person in science is not, as we are often wont to think, merely one who 
has an accumulation of facts, but rather one who has the capacity to have sport with 
what he knows, giving creative rein to his fancy in changing his world of phenomenal 
appearance into a world of scientific constructs (p. 145). 


TRENDS AND OMISSIONS 


Two additional lines of thinking and experimentation have been devel- 
oped on the thesis that the functions which most consistently differentiate 
psychotic individuals from nonpsychotics are primarily intellective or cogni- 
tive ones. These have been (a) our rapidly expanding knowledge of neuro- 
physiology, particularly the work reported this year by Penfield (100), 
Jasper (60), and Magoun (81) and (6) a renewed interest in neuropsychologic 
correlations. Kubie (71), for instance, attempts to integrate psychoanalytic 
constructs with neurophysiological processes, but admits that such formula- 
tions remain essentially vague as to the relationship of motivation, memory, 
\and symbolism to neural organizations. Malmo et al. (83) also apply Hebbian 
principles of neuropsychologic synthesis to the therapy of a case of hysterical 
deafness. While it is true that this kind of theorizing has not as yet led to 
ldefinitive formulations, the concepts of cell assemblies, dynamic system 
families, neural feedback, resonancies, etc., may eventually be corre- 
jlated meaningfully with determining tendencies, sets, expectancies, and other 
‘terms relevant to the molar dynamics of behavior. 


PsycCHOLoGic Factors IN DIAGNOSIS AND THERAPY 


Diagnosis—Comprehensive reviews on personality questionnaires by 
Ellis (29), on methodology by Berg (11), and on prognosis by Windle (155) 
highlight the difficulties and the disheartening results obtained from the use 
of standard clinical tools in attempts to sharpen differential diagnostic 
criteria. This year’s literature, in general, confirms White’s (148) observation 
that psychologists are losing interest in the re-examination of the classical 
nosological categories. For instance, the studies of Geist (46), Wittenborn & 
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Weiss (158), and Cohen (21) indicate that although various factors may be 
rotated out of correlation tables of ratings or test scores, they have relatively 
little relationship to the commonly accepted criteria for classifying a par- 
ticular patient in one or another of the currently accepted categories. True, 
Guertin (49), in an analysis of symptom ratings, extracted three factors 
labeled paranoia, hebephrenia, and withdrawal; however, Wittenborn & 
Bailey (156) and Wittenborn & Weiss (158), using essentially the same tech- 
niques, did not distinguish these factors and were unable to differentiate 
involution psychosis or manic depressive manics from other psychiatric 
groups. Mehlman (89) and Forer et al. (36), with special regard to the Ror- 
schach, likewise present data which further justify general dissatisfaction as 
to meaningfulness of current psychiatric diagnosis. In many cases behavior 
changes may be observed to vanish in the protocols of the tests, either be- 
cause: (a) the underlying structure of various psychoses is roughly the same 
and is so reflected in the diagnostic instruments or, (b) these instruments 
are not sufficiently sensitive to discriminate different processes. The latter 
is true in the studies of Winder (154)and Rausch (110) in which they were 
able to distinguish between paranoid and nonparanoid patients. Similarly, 
Theisen (142) employed five pathological Rorschach patterns supposed to 
discriminate normal subjects statistically from schizophrenic patients, yet 
found that 51.6 per cent of schizophrenic patients showed no pathological 
signs. In view of such large lacunae in the diagnostic sieve, the relationship 
of psychotic processes to Rorschach patterns remains questionable. 

Another study this year substantiates Siipola’s (132) observations that 
subjects respond primarily to the interaction of characteristics within the 
stimulus objects rather than being affected by isolated factors; Sanders & 
Pacht (122) point out that the interaction among variables (i.e., color se- 
quence interactions) rather than the variables taken singly produces signifi- 
cant results in distinguishing between normal and schizophrenic patients. 

Evaluation—A survey of the year’s work on evaluation and treatment; 
indicates: (a) that our standard psychological tests do not show improvement 
in psychological structure after therapy despite clinically favorable changes; 
(b) that in the majority of instances the criteria used to assay therapeuti¢ 
progress have not been remarkably successful in differentiating ae 
groups or in predicting behavior; and (c) that there is increasing need o 
restudying past criteria by methods of cross validation and validity generali- 
zation. Appeals in the latter regard are forcefully made by Windle (155), 
and by Auld & Eron (6) who failed to confirm clinical predictions derived 
from the formula developed by Kotov & Meadow (69). 

The testing programs conducted in psychiatric institutions on the com- 
parative effects of various therapies present essentially the same picture 
of indeterminacy. Jones & Peters (63) conclude that tests which are more 
sensitive to overt behavior detect alterations after therapy which are indica- 
tive of improvement, whereas isolated Rorschach variables show no signifi- 
cant changes in basic personality processes. Because of a growing dissatis- 
faction with standard psychologic tests for the evaluation of psychotherapy, | 
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\ there seems to be an increasing use of behavior rating scales for this purpose. 
Wittenborn and his co-workers (157), Guertin (49), and Rackow et al. (108) 
exemplify this trend, but Kubie (70), in his discussion of the values and uses 
of rating scales points out quite forcefully that the external behavior of the 
patient may tell us relatively little about his underlying personality structure, 
a position illustrative of Adolph Meyer’s definition of psychoanalytic theory 
as “the cult of the something else.’’ Nevertheless in the present state of 
our knowledge it appears that information provided by relatively objective 
rating scales must continue to be given proper heuristic weight in much of 
our clinical research. 

Prognosis——Holzberg & Belmont (55) found that out of 45 predicted 
common factors in the Rorschach and the Wechsler-Bellevue, only four pre- 
dictions were supported at a statistically significant level and two merely 
approached significance. Similarly, Filmer-Bennett (35) and Rogers & 
Hammond (118) were unable to find definite prognostic signs in their Ror- 
schach results. In contrast, Stotsky (137) reports reliable findings with both 
the Wechsler-Bellevue and the Rorschach. Freeman (40) tested the Mala- 
mud-Sands Rating Scale as a prognostic indicator of the effects of lobotomy 
and concluded that those patients likely to benefit from other forms of 
treatment were also favorable candidates for the lobotomy operation. Simi- 
lar conclusions were drawn by Feldman (33) in his use of the profiles from 
the Minnesota Multiphasic Personality Inventory for the prognosis and eval- 
uation of shock therapies, again indicating the possibility of utilizing such 
findings as a more reasonable basis for developing a treatment oriented 
nosology than the descriptive diagnostic categories now employed. However, 
in his review of the field, Windle (155) emphasizes the necessity for careful 
cross validation of such prognostic signs before accepting the findings of any 
particular study and suggests the organization of a general psychological 
research exchange which would facilitate coordination in this area of research. 


SocraL EVALUATIONS 


Etiology—The year produced few detailed investigations of the social 
psychology of the psychosis in relation to their treatment. Mark (84) studied 
the attitudes of the mothers of male schizophrenic patients and Redlich e¢ al. 
(111) conducted a demographic study in this field, but no specific conclusions 
were derived as to the characteristics or the functions of social groups in rela- 
tion to individual conduct. 

Social therapy— Caudill et al. (19) report the experiences of a participant 
observer on a ward of neurotic patients undergoing treatment and stress the 
importance of group therapeutic orientation. Rioch & Stanton (115) also 
discuss various factors in milieu therapy and support Sewall, Gillin & LeBar 
(128) in their appeal for an analysis of the impact of the hospital on the 
patient. It has been a widely held belief that institutionalization has a dele- 
terious effect on the ability of many patients to make adequate adjustments 
outside the hospital, yet few attempts have been made to understand the 
specific intramural factors that produce these effects. Recent reports by 
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Galioni, Adams & Tallman (45), Sines, Lucero & Kamman (133), and 
Miller & Clancy (91) on improvements in hospital attitudes and regimen 
confirm their favorable effects on the behavior of many patients, but again 
such correlations remain impressionistic. It is in the relatively controlled 
microcosm of the hospital that such reciprocal relationships between the 
individual and his social environment can be studied in psychological depth, 
configuration, and detail; the knowledge so derived may perhaps then be 
applied to urgent problems in extramural human affairs. 


no 
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THEORY AND TECHNIQUES OF ASSESSMENT?” 


By E. LowE.it KELLY 
Department of Psychology, University of Michigan, Ann Arbor, Michigan 


Many readers will be surprised to discover that this volume of the 
Annual Review contains no chapter on psychodiagnostics, and that this 
topic, together with the diagnostic aspects of counseling and the methodo- 
logical aspects of personnel selection, have all been treated in a single chap- 
ter. The writer accepted the challenge of treating these formerly separate 
topics in a single chapter because of a conviction that although the essential 
problems confronting these several applied fields are the same, each has 
suffered in its development by the lack of acquaintance of workers in one 
with the orientation and methods of the others. 

Whether the applied psychologist functions in the clinical, counseling, 
or industrial personnel field, he is faced with the necessity of evaluating 
individual subjects (patients, clients, or applicants) and characterizing 
them (by categories, types, test scores, profiles, or verbal personality 
formulations) for the purpose of doing something, saying something, or 
implying differential prediction of the future behavior of the subjects in 
another domain (adjustment, performance, etc.). In general, it would seem 
that the explicit recognition of this total assessment process has been 
greater among personnel psychologists than among clinicians, many of whom 
delimit their diagnostic function to making a “‘personality study” for the 
purpose of describing the personality dynamics of the patient or ‘‘arriving 
at a better understanding’’ of him. Presumably, however, the psychologist 
(or someone else who reads his report) utilizes the personality evaluation 
to make explicit or implicit predictions regarding the patient; in other words, 
the assessment is used in a manner which makes a difference in the disposi- 
tion of the case or the treatment of the patient. Diagnosis that is not also 
prognostic would hardly seem defensible. 

While personnel psychologists seem to have been somewhat more sys- 
tematic in carrying assessment research through to its logical conclusion 
by testing their predictions against life criteria, they seem to have been far 
less imaginative and creative in the construction of new types of tests and 
test items which might add measurably to the accuracy of their predictions. 
Clinicians and personality psychologists on the other hand, while more 


1 This summary is based primarily on references published between June 1952 
and May 1953, with an occasional reference to earlier background studies. Because of 
the scope of the topic, the reviewer has been able to refer to less than a third of the 
articles and books which appeared during the year. Inevitably, some readers (and 
many authors!) are likely to feel that essential references have been omitted. 

? The following abbreviations were used in this review: MMPI (Minnesota Multi- 
phasic Personality Inventory); TAT (Thematic Apperception Test). 


281 








282 KELLY 


creative with respect to novel hunches for test construction, seem to have 
been remarkably negligent in the utilization of rigorous procedures to sepa- 
rate good hunches from bad. 

In spite of the broad array of content encompassed in this review, it 
may serve to encourage varied groups of applied psychologists to look at 
their old problems with new perspectives and perhaps to read a few refer- 
ences which they otherwise might have overlooked. 

Since most published papers relating to assessment are concerned with 
relatively specific problems of evaluation or prediction encountered by their 
authors in day-to-day applied duties, it is difficult to discern the major the- 
oretical and methodological issues. Work in this field is in no sense system- 
atic or clearly delineable with respect to either theories or methods. And, 
although all assessment psychologists are in general agreement that predic- 
tion of a selected sample of future behavior (the criterion) must be based on 
selected samples of behavior, this is about as far as the agreement goes. 
On other issues: what theory?, what variables?, what techniques?, how to 
combine the data?, etc., there are wide differences of opinion and practice. 


GENERAL CONTRIBUTIONS 


Without doubt, one of the most valuable general publications in the field 
of assessment is Buros’ (18) Yearbook of Mental Measurement, listing 793 
tests and containing 596 original test reviews. The objectives of the test re- 
view section remain the same as those announced for the third Yearbook 
published in 1940. One of these eight objectives is ‘‘To impel authors and 
publishers to place fewer but better tests on the market and to provide test 
users with detailed and accurate information on the construction, valida- 
tion, uses, and limitations of their tests at the time they are first placed on 
the market.’’ While it is impossible to evaluate the degree to which the 
successive Yearbooks fulfill this or its other worthy objectives, this reviewer 
believes they serve a series of most useful functions and hopes that Buros 
will find it possible to continue this heroic undertaking. He would further 
hope, however, that before too long, the editor might find his task made 
easier by a marked reduction in the number of different tests which need 
to be included. Perhaps it is asking too much to suggest that the author and 
publisher of a test declare it obsolete, but the idea does not seem without 
merit, especially since the profession seems loathe to establish any procedure 
for “certifying” the quality of tests and other assessment techniques. 
Surely, there must be some way to get rid of old tests as well as to encourage 
higher standards in the development of new ones. 

Perhaps, the job of cleaning out the test attic will be speeded up by the 
important work of the American Psychological Association Committee on 
Test Standards. In its ‘Technical Recommendations for Psychological 
Tests and Diagnostic Techniques”’ (3) this Committee attempts to develop 
standards, not for tests, but for test manuals. These proposed standards 
are grouped in three levels: ‘‘Essential,”’ ‘‘Very Desirable,”’ and “‘ Desirable.” 
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It is the writer’s impression that, even if judged by the minimal standards 
recommended by this Committee, relatively few of the most widely used 
tests would be judged ready for publication and use. 

Of especial relevance to assessment are the Committee’s distinctions 
among four kinds of test validity: predictive, status, content, and congruent. 
“Predictive validity”’ is defined as correlation of test and subsequent cri- 
terion measures, ‘‘status validity” as correlation of test and concurrent ex- 
ternal criteria. ‘‘Content validity’”’ refers to the degree to which the test 
samples the universe of content specified, as in an achievement test. Finally, 
“congruent validity” is defined as demonstration, by any one of a variety 
of procedures, that the test actually measures the psychological variable it 
is alleged to measure. The familiar example is that of showing a substantial 
correlation with other measures of accepted validity, e.g., validating a new 
group test of intelligence against Binet I. Q.’s. 

Much more difficult is the task of demonstrating the validity of a test 
designed to measure a hypothetical construct. Here, the task is that of 
demonstrating that scores on the test correctly predict behavior differentials 
deduced from the same theory that led to the construct and to the test; i.e., 
“the validation process is much the same as that involved in evaluating a 
theory itself.’’ This type of validation is well exemplified by the program of 
research on the Blacky Pictures, originally published by Blum as a research 
tool rather than a diagnostic technique. In their recent paper, Blum & 
Hunt (15) review a dozen studies relating performance on the Blacky test 
to a wide variety of behavioral criteria selected on the basis of deductions 
from psychoanalytic theory. All findings seem to point to the probable 
validity of the scores for at least certain of the ‘‘psychosexual dimensions.” 
Further, the findings indicate that these are evaluated with sufficient re- 
liability to discriminate between groups of subjects. As yet, little is known 
about the reliability of these measures for individual subjects or their use- 
fulness as a tool in the assessment of individuals for the prediction of their 
future performance. 

In assessment, psychologists are necessarily concerned with predictive 
validity of a technique for the specific criterion. If a technique has been 
shown to have status validity for group differentiation relevant to the 
criterion, or if there is good evidence of its having congruent validity for a 
relevant variable, this makes the technique promising as a predictor. The 
proof, however, lies in putting it to an actual test in the assessment situation 
and correlating it with the relevant criterion. 

The possible hazard of using a technique in assessment by extrapolating 
too far solely on the basis of data regarding congruent validity is illustrated 
in a paper by Ellis (31). For an individual subject, using only the Blacky 
protocol (administered after 200 hours of analytic treatment), clinical 
psychologists attempted to make a number of inferences about the per- 
sonality and adjustment of the patient. The judges showed reasonably high 
agreement among themselves but not with the criterion judgments of the 
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therapist and patient. Apart from any limitation of this particular study, 
it seems clear that congruent validity is no guarantee of predictive validity 
in the assessment of individuals. 

Reliance on congruent validity alone as the basis for the selection of 
assessment techniques appears hazardous in another respect: validity is 
usually regarded as demonstrated when it can be shown that scores or 
ratings differentiate two known groups at the .01 level of significance. Now 
if the groups are large enough, or if they are extreme enough with respect 
to the criterion dimension, one can achieve a P of .01 and still account for 
precious little variance in the criterion. The assessment psychologist is 
obligated to evaluate techniques on the basis of the amount of variance in 
the criterion predictable from a given predictor. Thus, it is entirely possible 
that a test which has been shown to measure an attribute or variable 
essential for a given prediction, may prove to be useless in actual assessment 
because the test measures so little of the variable or attribute. Even if the 
test scores are sufficiently saturated with the variable to yield significant 
validity coefficients against an external criterion, the assessment psycholo- 
gist is still obliged to ask whether another test will yield even higher validity 
or, if no higher, if it is more efficient in terms of cost, professional time, and 
convenience to the subjects. 


THE EMPIRICAL VERSUS THEORETICAL CONTINUUM 


In general, it appears that the most valid assessment procedures devel- 
oped thus far by psychologists are those in which the investigator has (a) 
selected a criterion and (b) searched widely and diligently for behavior 
samples (tests) which elicit differential responses correlated with the criterion 
measure. Scoring keys thus derived are cross-validated on a new sample and 
the test is ready for use. This empirical approach has been shown to work 
in a variety of assessment situations. It seems likely that such empirically 
derived instruments will be necessary for a long time, that is, until we know 
far more about personality and its evaluation. 

The limitations of the extreme empirical approach are well recognized 
and serious. It leads to techniques with an unknown but probably limited 
degree of generality. The validity of the scoring key or regression weights 
can be expected to decrease or even to disappear if used with subjects differ- 
ing in certain respects from those on whom the weights were developed or 
or if it were used to predict a related but different criterion. Its usefulness 
must therefore be empirically tested anew in each situation, and even in 
the same situation with changes over time. Worse still such empirically 
derived instruments are not likely to make systematic contributions to 
personality theory. They do not permit ready identification of the variables 
or dimensions of personality relevant to effective performance on the 
criterion and hence do not, except in a very general way, add to the psycholo- 
gist’s ability to come up with fruitful hypotheses for use in the next assess- 
ment situation. 
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At the other end of this continuum, we find the theory or technique 
oriented assessment psychologist, who because of his particular orientation, 
is convinced that one or more personality variables are salient in determining 
the behavior of persons in a wide variety of situations. Any specific assess- 
ment problem is, therefore, seen largely as that of evaluating these assumed 
“key” personality variables in each subject and then predicting, on an a 
priori basis, each subject’s performance in the criterion situation. Such 
predictions may or may not be subjected to empirical test, but even when 
they are, this investigator is likely to be satisfied with a demonstration 
that the key variable or variables is significantly related to the criterion. 
Having proved to his satisfaction the correctness of his theoretical deduction, 
he feels free to use both his theory and his technique in a variety of assess- 
ment situations. It is rare for the investigator to show the slightest concern 
as to whether his favorite assessment variable accounts for more than the 
tiniest fraction of the variance in the criterion, and he is likely to be blissfully 
unaware of the above noted hazards of generalizing on the basis of a demon- 
strated correlation between a predictor and a specific criterion. 

In between the empirical approach and that of emphasizing particular 
variables, or favorite techniques, are the systematic research programs of 
Cattell, Eysenck, and Thurstone directed toward the identification and 
measurement of the ‘“primary”’ variables of personality. In spite of differ- 
ences in the details of their work, all of these investigators are convinced 
that further progress in the evaluation of personality and the prediction of 
behavior demands, first, an orderly taxonomy of the personality domain 
and, second, the development of objective measures of the variables which 
emerge. All are sampling and measuring a large number of variables and all 
are using factor analysis as a means of testing the degree of overlap or 
independence of the variables which they are measuring. Cattell & Wenig 
(22) even go so far as to invade the domain of projective tests with the 
factor analytic approach. Eysenck, in his book The Scientific Study of Per- 
sonality (33) unabashedly argues for not only the desirability but the 
absolute necessity of developing a firm taxonomy of personality as a basis 
for further progress in psychology. In contrast to the more widely accepted 
organismic orientation among most clinical psychologists, Eysenck takes an 
outspokenly atomistic, elementaristic point of view and is well along on an 
elaborate program of research directed at the classification, identification, 
and objective measurement of personality types and traits. 

In this reviewer’s opinion, Eysenck’s book was easily the most impor- 
tant publication in the entire field of psychology during the past year. It 
presents important experimental findings on a large number of controversial 
issues in personality theory and psychopathology; furthermore, the findings 
point to conclusions considerably at variance with current teaching: e.g., 
“‘psychoticism” and “‘neuroticism’’ are independent dimensions of person- 
ality; both are continuous variables in normals as well as pathological sub- 
jects; neuroticism, as objectively measured, is apparently as much a genetic 
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factor as general intelligence! Whether the reader agrees or disagrees with the 
interpretations, he simply cannot afford to overlook Eysenck’s results. 

That personality variables: identified by the factor analytic process 
may have predictive validity for a wide range of criteria is strongly sug- 
gested by Barron’s paper (5) on the Complexity-Simplicity dimension of 
personality. This factor independently identified by two investigators, using 
different tests, appears to correlate with a great many socially relevant be- 
haviors. 

Gough’s work during the last five years, leading to the development of the 
California Psychological Inventory (51) represents a nice blend of the 
theoretical and empirical approach to assessment. Although guided by the 
goal of identifying and measuring personality variables of general predictive 
value, Gough has also developed a number of empirically derived scales for 
the inventory designed to predict a variety of criteria; e.g., social participa- 
tion, delinquency, and academic achievement. It is of interest to note that 
different scales are necessary for the prediction of academic achievement in 
high school, in college, and in graduate schools. This suggests either that dif- 
ferent motivational variables are operative or that they interact in different 
ways to influence academic achievement at the three educational levels. 


THE ROLE OF THE PSYCHOLOGIST IN ASSESSMENT 


In perhaps no other respect do psychologists differ so completely as in 
their answer to the question of the appropriate role of the psychologist in 
assessment. In the absence of objective tests of personality, psychologists 
have been forced to rely heavily on the human observer to discriminate 
essential personality variables and ‘‘to rate’’ subjects on these variables. 
Furthermore in the development of projective methods, the human inter- 
preter came to be regarded as essential for the proper evaluation, integration, 
and interpretation of the behavior sampled by these techniques. 

Even before the days of psychological tests and projective techniques, 
the practical problems of assessment were for the most part met by the 
selection or diagnostic interview in which a human being plays the role of 
the test, scoring key, and regression equation all rolled into one. The authors 
of Assessment of Men (90) state baldly that the interview is probably the 
best and only indispensable method of assessment. Rodger (91) goes as far as 
to say “the interview is the standard means whereby people are judged 
for many purposes, and it is likely to remain so, maybe till the end of time.” 
In an interesting paper entitled ‘“‘The Worthwhileness of the Interview,” 
Rodger admits the generally low reliabilities and validities of interview 
assessment found in most studies, but takes the curious position that none 
of the studies proves its uselessness. He defends the interview on the basis of 
its “‘acceptability”’ and calls on psychologists to improve rather than aban- 
don the technique. Crissy (26) also emphasizes the crucial need for research 
on the selection interview. His paper includes a good discussion of the 
inherent difficulties in such research and methods of meeting them. 
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Rodger’s defense of the interview seems to have been brought about 
largely by the findings of the Michigan project (71) in which the interview 
did not contribute essentially to assessment. He appropriately notes a num- 
ber of features of the research design which could have contributed to low 
validities found for the interview. In the meantime, Holt & Luborsky (67) 
have published a second interim report on the Menninger Assessment Pro- 
gram which relied most heavily on judgments based on the interviews and 
a battery of individually administered psychological tests. In this project, 
each applicant was interviewed independently by three psychiatrists, each 
of whom made a global prediction regarding the candidate’s probable success 
in psychiatric training. Note that in this study the interviews were conducted 
by staff psychiatrists, presumably experts in the art of interviewing and 
maximally familiar with the expectations and demands of the criterion 
situation. Even so, the validities of the judgments based on these interviews 
are extremely low: for 14 interviewers who rated from 9 to 98 applicants, 
the median validity is only .08. 

In spite of accumulating evidence of this sort, the interview seems to 
have no serious competition as the most popular assessment technique. 
Swenson & Lindgren (111), in a survey of Minnesota industries, found it 
to rank first among personnel selection procedures. Crissy (26) also notes 
that it continues to be the most widely used personnel selection method in 
private industry. Stalnaker & Eindhoven (107), in a survey of medical 
school admission requirements, reports that applicants are required to report 
for a selection interview in 53 of 80 medical schools. 

Although the interview is the primary diagnostic technique employed 
by psychiatrists, there are very few published studies bearing on its pre- 
dictive validity. With respect to reliability, the evidence from older studies 
suggests inter-interviewer agreement is likely to be considerably better 
than chance but still relatively low. Since in most instances no external 
criteria are available, no estimate of predictive validity is possible, and 
““correctness” is assumed on the basis of consensus in interviewer judgments. 

Two newer studies provide slight evidence on this point. In a study of 
the relationship among several measures of anxiety, Gleser & Ulett (49) 
report that the ratings of psychiatrists of ‘‘anxiety proneness” correlate .55 
with scores on the Saslow Screening Inventory and .54 with the ratings of 
clinical psychologists based on a battery of individual tests. Even here, 
however, it was of interest to note that the objective measure correlates 
as high with each set of ratings as did the ratings of the psychiatrists and 
psychologists with each other. Furthermore, since the objective scores on 
the Saslow Inventory correlate higher with the average of the two ratings 
than with either set of ratings, it must be concluded that the Inventory is a 
better measure of whatever the judges were rating than either set of judges. 

The other study involving a check of psychiatrist-interview judgments 
against outside criteria is not so encouraging. Eysenck (33) reports an 
unpublished doctoral dissertation by Heron which involved an assessment of 
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factory workers and an analysis of the relationship between personality 
variables, productivity, and work adjustment. In this study, the psychia- 
trist’s rating of ‘“‘mental health” was found to be essentially uncorrelated 
with any of the other variables. (As a matter of fact, its low correlation with 
“neuroticism,” objectively measured, was in the wrong direction!) Eysenck 
attributes these unexpected findings to the fact that the psychiatrist was 
forced to make his evaluations under unfamiliar conditions, e.g., the subjects 
were asked to come in for the interview and the psychiatrist did not have 
access to any records of the subject’s work history. 

The role of the psychologist in the assessment process is even more 
important for the proponents of ‘“‘organismic assessment,” who assume that 
“the trained psychologist or psychiatrist, with a fund of additional facts 
at his disposal is, today, capable of improving to a significant degree the 
accuracy of mechanical predictions derived from test scores alone’ (90, 
p. 52). Although the blunt statement of this position is rarely encountered 
in print, it appears to be an implicit belief on the part of most users of diag- 
nostic and selection interviews, projective techniques, situation tests, and 
other procedures which depend on the human interpreter or integrator of 
data. 

The curious state of affairs wherein the most widely (and confidently) 
used techniques are those for which there is little or no evidence of predictive 
validity is indeed a phenomenon appropriate for study by social psycholo- 
gists. This reviewer can only assume that in the absence of evidence of their 
predictive validity, such techniques must serve an important function other 
than assessment. Is it that they serve primarily to reduce threats of anxiety 
for persons confronted with the necessity of making significant decisions in 
the lives of individual clients or patients? Lacking dependable validated 
techniques for making the predictions essential to wise decisions, it should 
not be surprising if persons responsible for professional decisions were to 
gravitate to the use of techniques which yield a relatively large amount of 
information concerning the subject, regardless of how irrelevant most of the 
information may be. Add to extensity of information provided by the 
technique a theoretical orientation sufficiently flexible to permit using the 
information in a manner which seems ‘‘to explain” any subject, and we have 
what would seem to be the necessary ingredients for an anxiety-reducing 
prescription! Whether this analysis is correct or not, it cannot be denied that 
in most practical assessment situations, the choice of techniques seems to be 
largely a function of user confidence rather than demonstrated validity. 

The organismic hypothesis and the essentiality of the human being in the 
assessment process is also discussed at considerable length by Eysenck (33) 
in a chapter entitled ‘‘The Organization of Personality.’’ Eysenck reviews 
the evidence from three levels: total assessment; an intermediate level, as in 
the case of Rorschach interpretation along statistical or intuitive lines; and a 
more fundamental level, as in perceptual organization. All of the evidence 
serves to cast serious doubt on the correctness of the organismic position. 
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In any event, the introduction of the human interpreter or integrator in 
the assessment process greatly complicates the problems of validation, 
since individual differences in the human being may contribute to the vari- 
ance of the predictions which emerge. Since, in such situations one cannot 
properly speak of the validity of a technique but only of a “‘technique-user 
combination” it would follow that for any given assessment situation, there 
should be a prior determination of the person-technique-criterion validity of 
each staff member! 

Perhaps, as the result of accumulating evidence concerning the low 
predictive validity of judgments based on projective techniques, most recent 
studies have shown a trend toward the objectification of scoring and the 
utilization of categories of responses as variables. This trend has been viewed 
with alarm by Hanfmann (59), who while admitting there is no good reason 
why projective techniques should not be used objectively, takes the position 
that attempts to improve projective techniques by making them more 
objective ‘‘ . . . may be expected to fail precisely to the extent that they suc- 
ceed.”” She believes that the essence of projective psychology lies in 
‘‘ |. . what the psychologist does with the data obtained and in the nature 
and goal of treatment he accords them.”’ She goes ahead to plead for the use 
of projective data via the technique of “symbolic interpretation” as a 
means of ferreting out ‘‘personal meaning” of the material for the case 
under study and for arriving at new insights. While she does not insist that 
this approach is necessarily superior to others, she believes it is an approach 
worth developing in its own right and argues for ‘‘further study of the inter- 
pretative process.”’ 

With this plea, the present reviewer is in complete accord. In fact, as he 
and Fiske have argued elsewhere: 


It is entirely possible that an extensive investigation would reveal some gifted 
persons who by procedures not presently communicable, can make clinical predic- 
tions superior to those possible by statistical psychometric procedures. If this should 
be the case, it is essential that efforts be made to identify the methods employed by 
such persons so that these methods may be communicated to others in training. If 
their methods cannot be identified or communicated, it would still be worthwhile to 
identify the essential characteristics of such persons so that others similarly gifted 
might be selected to function in a similar manner (71, p. 202). 


However, in the absence of supporting evidence concerning the contri- 
bution of ‘‘understanding”’ to predictive validity, this reviewer cannot con- 
cur with the position taken by Rotter in the Volume 4 of the Annual Review 
of Psychology (94) condoning the choice of assessment techniques on the 
basis of ‘‘best fit—own theoretical orientation and purpose,” to be used as 
‘“‘a personal instrument,” etc. If the interpretative approach, and the feeling 
of ‘‘understanding” engendered by the use of certain techniques is of value, 
there must be some way of demonstrating it other than by “testimonials.” 
If its value cannot be demonstrated, psychologists might well utilize at 
least a part of the time currently expended on techniques of doubtful validity 
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in developing more valid tools for the tasks at hand. All too frequently, the 
feeling of ‘‘understanding”’ in the user of a technique is regarded as adequate 
“validation,” i.e., the technique is said ‘“‘to work.”” Worse still, new genera- 
tions of students are taught to arrive at similar “‘understandings”’ of indi- 
vidual cases with no validity other than consensual judgment, which un- 
fortunately, can be as invalid as the judgment of an individual. If, ‘‘explana- 
tion” is a doubtful criterion for use in personality research as MacKinnon 
suggests in ‘‘Fact and Fancy in Personality Research’”’ (82), it is even more 
doubtful as the proper criterion of validity in a single case! 

While this reviewer doubts that the approach suggested by Hanfmann 
will lead to the development of human beings capable of handling the com- 
plex problems of best weighting data for the prediction of criteria, he is 
quite convinced that the interpretative approach is most promising for the 
development of hypotheses leading to better objective procedures. The 
apparent “understanding” of a single case can lead to many creative hy- 
potheses, each susceptible of empirical test. Since persons most gifted in 
developing intuitive hypotheses are frequently not inclined to feel it neces- 
sary to subject them to rigorous verification, psychologists seeking to de- 
velop new and better objective tests might well look to the clinical literature 
for such worthwhile leads. Thurstone in a recent publication goes so far as 
to state: 


Projective procedures are the nearest approach to personality tests. A projective 
test is essentially an ambiguous presentation to which the subject can respond in a 
variety of ways. In so doing he may reveal his personal idiosyncrasies. The best form 
of projective test is one which is quite unstructured for the subject but fairly well 
structured for the examiner. In this manner a projective test can be objectively 
scored; but the interpretation of the scores is often as unstructured as the test is for 
the subject, and then the test is useless for scientific inquiry (112, p. 1). 


In summary, the role of the human being in the assessment process con- 
tinues to be a much argued question. For some, he is the sine qua non in the 
assessment process. For others, he is the necessary architect, designer, and 
engineer, but to the degree that he enters into the assessment process itself, 
he is merely an added source of error variance! 


GENERAL REVIEWS RELEVANT TO ASSESSMENT 


Although of varying relevance, a number of review articles published 
during the year help to provide a perspective of recent trends and develop- 
ments in assessment methodology. 

Schofield (100) in his third annual review of research in clinical psychol- 
ogy classifies 143 papers into 20 research areas and notes changes over a 
three year period. For 1951, fully 90 per cent of all articles reviewed concern 
one or another aspect of clinical diagnosis and assessment as used in this 
chapter. In Schofield’s opinion, the most significant contribution of the 
year was the series of related papers by Wittenborn and his associates 
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describing the development of an objective, quantitative system for multiple 
psychiatric diagnosis, a procedure which results in a profile of scores on seven 
factors or clusters of symptoms. 

The most extensive review of the year was that done by Windle (120) 
on ‘‘psychological tests in psychopathological prognosis.” Windle analyzed 
90 research studies, organizing the findings on the basis of the particular 
test employed. In a master table covering seven pages of fine print, he 
has abstracted each investigation indicating: the author; the number and 
type of patients involved; the duration of illness; the follow-up time; the 
type of therapy, if any; the prognosis made for each of the prognostic indices 
and the level of significance of the finding. Where the original author did 
not report statistical significance, but did provide the necessary data, Windle 
calculated it. The over-all picture is one which should serve to shake any 
existing complacency regarding the predictive validities of our most widely 
used diagnostic tests. Even more alarming: ‘‘Many of the studies failed to 
demonstrate empirical justification for the conclusions drawn. Further, 
comparison among studies revealed little agreement among findings.” 
Windle emphasizes the need for better research studies in this field and 
notes that most of the published investigations fail to contribute signifi- 
cantly because: (a) they fail to specify the conditions of the experiment, 
e.g., description of the patient population, conditions of therapy and 
criteria of outcome; (b) they fail to deal with homogeneous populations 
and conditions; (c) they fail to report their results in terms amenable to 
statistical evaluation; and (d) the reported findings are only rarely subjected 
to cross-validation. Unfortunately, most of the new studies appearing during 
the past year have likewise been defective in one or more of these aspects. 
Perhaps the most frequent error is the over-optimistic estimates of P 
values growing out of misconceptions and misapplications of the laws of 
probability. In the absence of statistical sophistication and with the motiva- 
tion to ‘“‘prove’’ that a particular test score or index is valid, many investi- 
gators appear all too anxious to rush into print with “‘promising findings.” 

Ellis (32) undertook a summary of all research using personality inven- 
tories published between January 1946 and December 1951. He presentsa 
tabular analysis based on 499 studies with respect to the discriminations 
tested. Although this analysis shows inventory scores to have discriminated 
significantly in over half of the studies, Ellis is generally critical of their 
validity and suggests that where inventories are most effectively used they 
tend to become as time consuming as alternative psychodiagnostic proce- 
dures, such as interview and projective techniques, which to Ellis, ‘‘are 
more clinically incisive and valuable.” Evidence for this latter opinion is 
not presented! 

Ghiselli & Barthol (47) provide a more limited review of the validity of 
personality inventories in the selection of employees as reflected in 113 
studies since 1919. They report that inventories have been found useful in 
a variety of jobs but that there are enough negative results to suggest cau- 
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tion in their use. In general, the validities reported were higher for sales 
clerks and salesmen and lower for supervisors and service workers. 

Levine & Tupes (78) summarize postwar research in pilot selection and 
classification. The pilot selection tests developed during World War II have 
shown surprisingly stable validity over a period of eight years under a 
variety of conditions, their validity being particularly good for the criterion 
of flying deficiency against which these scores were originally validated. 


CRITERION PROBLEMS 


In general, assessment psychologists are becoming increasingly aware of 
the complexity of the criterion problem. Traditionally, they were likely to 
select a criterion on the basis of its availability, its simplicity, or simply to 
take for granted that the most commonly used measure would serve as an 
acceptable criterion. If efforts at predicting the criterion were not successful, 
it was generally supposed that the fault lay in the predictor variables used. 
Gradually, we seem to be discovering that the behavior constituting per- 
formance in the domain to which we wish to predict is fully as complex, 
probably as multidimensional and as difficult to measure as the personality 
variables used as predictors. A variety of papers reflected this trend during 
the previous year. 

Diagnostic categories as criteria.—Following his earlier publication out- 
lining the method of criterion analysis, Eysenck (34) argues for the utiliza- 
tion of socially relevant criterion measures (e.g., psychiatric diagnoses) as 
transitional criteria for the identification of psychological variables which 
can, in turn, be used to sort people into more meaningful diagnostic cate- 
gories and with greater accuracy than is now possible. Lorr (79) presents a 
logical analysis of the essentials of diagnosis, emphasizing the multidimen- 
sional nature of most psychiatric conditions and the desirability of describing 
patients on dimensions which are relatively homogeneous. Du Mas (30) 
in a stimulating paper entitled “‘On the Mathematical Representation of 
a Syndrome” suggests a procedure for selecting and ordering the components 
of a diagnostic profile. That even the older diagnostic categories may have 
some predictive validity is suggested in a paper by Hunt, Wittson & Hunt 
(70) who found that naval personnel labelled with a definite psychiatric 
diagnosis were more likely to receive neuropsychiatric discharges than 
those with less specific diagnoses. 

Wittenborn & Weiss (121) using the quantified multiple diagnostic pro- 
cedure report wide differences among patients diagnosed with manic de- 
pressive psychosis, manic state, and argue that the conventional diagnosis 
obscures important differences among them. Guertin (56) on the basis of a 
factor analytic study of behavioral symptoms identifies six factors in a 
group of patients even though are all classified as schizophrenic. In an 
inverted factor analytic study of a sample of the same group of patients 
Guertin (57) identifies three types of schizophrenic patients corresponding 
essentially to the present typology. The paranoid and hebephrenic types 
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appeared to be the most frequent mixture. He failed to find any evidence 
whatsover of a general factor of schizophrenia. Turning to another diagnostic 
category Lorr & Jenkins (80) report a factor analytic study of the symptoms 
of maladjustment in delinquent children and conclude that there are at least 
five relatively independent factors in the global diagnoses of juvenile 
delinquency. 

Other criterion studies—Related studies from many other fields empha- 
size the multidimensional nature of most criteria. Ryans & Wandt (97) 
report that five factors are necessary to describe the behavior of teachers 
in elementary school; in a related study, Ryans (96) found six factors are 
needed to describe the behavior of secondary school teachers. Rush (95) 
identified at least four factors in sales success and concludes that the attempt 
to predict the global criterion may obscure important relationships between 
the predictor and the true criterion. Severin (103) notes that in the industrial 
field performance during the period of training correlates only about .20 
with later job performance and hence the two cannot be considered as 
equivalent criteria. Brown & Ghiselli (17) also note that tests which predict 
training criteria may have little or no predictive power for the criterion of 
job proficiency or vice versa. Heron (65) finds that even among unskilled 
factory workers it is necessary to differentiate at least two dimensions of 
job success: average productivity and the supervisor’s rating of job adjust- 
ment. Similarly, Fleishman (38) identifies two dimensions of supervisory be- 
havior, each of which can be measured with respectable reliabilities and yet 
correlate —.02. Only if the criterion represents a relatively limited domain 
of behavior does it seem likely that a single criterion measure is adequate 
for assessment study: Gaier (45) reports a single common factor adequate 
to explain variance in the performance of medical school freshmen in five 
courses. Enlargement of the domain of criterion behavior seems, however, 
to lead almost immediately to more dimensions. Thus, Newman, French & 
Bobbitt (89) in two independent analyses of successive classes at the Coast 
Guard Academy find three independent clusters of cadet performance: (a) 
adaptability to academy life and practice, (b) athletic proficiency and atti- 
tudes, and (c) academic grades. 


NEw STUuDIES OF ASSESSMENT 


Only two reports describe new applied programs of assessment. Harrison 
& Jackson (61) describe a program for the placement of young engineers in 
a large organization. On the basis of a battery of objective and projective 
tests a personality description is prepared for each employee. The authors 
report that “‘.. . ninety-one per cent of the supervisors of these men report 
that the evaluation was correct.” Similarly Higham (66) describes the use 
of tailored situation tests and staff judgments ina personnel selection program 
of a British industry. He regrets being unable to report any validity but 
says that the sponsor seems to be satisfied! 

A systematic series of investigations dealing with the one variety of 
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global assessment, Leaderless Group Discussion (LGD), is being conducted 
by Bass and his associates at the University of Louisiana. At least six addi- 
tional studies growing out of this program have appeared during the last 
year. 

The first study, by Bass & Coates (7), reports on the use of the LGD 
technique in forecasting Officer Potential. Subjects were third year ROTC 
cadets, the criteria were two sets of ratings by tactical officers. Reported 
validities were approximately .40 for the Army cadets and .46 for the Air 
Force cadets. Since the subjects studied were already selected for ROTC 
training, the authors suggest that the validity of the technique in screening 
applicants might be even higher. 

Another study (122) deals with the validity of leaderless group discussions 
amongst strangers. Trained judges were found to agree .92 and to predict the 
criterion (sociometric choice for leaders by fraternity members) with an r of 
40. Bass & Wurster (10) direct their attention to the effect of the nature of 
the problem on LGD performance. In still another paper, Bass, Klubeck & 
Wurster (8) find that the validity varies from discussion to discussion even 
though the interjudge agreement remains stable. Validity was also found to 
vary with the participant’s outside leader status, the more stratified status 
the better being the discrimination. In a related study, Bass & Wurster (11) 
reported a correlation of .88 between participant rank in an industrial con- 
cern and his leader behavior in a supposedly leaderless situation. This leads 
to the conclusion that LGD rating are not valid where participants are of 
known different rank and prestige. 

In spite of the demonstrated predictive validity of LGD scores, two ques- 
tions appear relevant. First, do such scores, obtained in this rather eleborate 
procedure, predict the criteria of leadership any better than more economi- 
cally obtained predictors? In one of the studies noted above, the ACE test 
correlated .28 with the criterion and in another a correlation of .54 was re- 
ported between paper-and-pencil predictors and supervisory success. 

A related question concerns the identification of the personality variables 
contributing to the rated leadership. In an earlier study Bass (6) reported 
the amazing correlation of .93 between the ratings of judges and the actual 
number of words spoken by subjects in the LGD situation! In a more recent 
paper, Bass and four coauthors (9) report correlations of .37 and .22 with 
Guilford-Zimmerman Ascendancy and Sociality Scores, and a significantly 
larger number of Rorschach responses for subjects judged as good leaders. 
While it is of theoretical interest to determine the personality variables in 
the subjects which lead judges to rate them high or low on “leadership,” 
one may well ask, why not use measures of these variables to predict the 
criterion directly? Only if it can be demonstrated that the LGD ratings of 
trained observers predict leadership criteria better than more economical 
measures of personality variables would it seem defensible to use the LGD 
procedure in practical assessment situations. 

Also in an effort to identify the major personality variables assessed in 
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situation tests, Sakoda (98) subjected the correlational data published in 
Assessment of Men to a modified cluster analysis. He reports that O.S.S. 
(Office of Strategic Services) ratings were a function of at least two varieties 
of situations in which the candidates were observed: verbal and active. 
In another cluster analysis of ratings on the ten traits, he finds three clusters: 
abilities, interests and motivation, and social adjustment. On the basis of 
these findings, Sakoda argues that a personality trait must be defined in 
terms of context and kind of situation in which behavior occurs. It should 
be noted that this analysis deals with traits as rated, and like all factor stud- 
ies of ratings could reflect, at least in part, perceptual clusters in the minds 
of raters as well as actual clusters in the behavior of subjects being rated. 

In spite of often reported high interjudge agreement in the evaluating of 
subjects by interview or other techniques, it is rare to find such reports 
accompanied by information pointing to the basis for the agreement. 
McKeachie (86) in a controlled experiment found that the presence of lip- 
stick significantly affected certain personality ratings assigned six female 
subjects on the basis of a ten-minute interview. Inquiry revealed that the 
raters were totally unaware of this cue. Grossack (55) reports that first im- 
pressions are influenced both by identifiable cues and by expectations. 
Interjudge agreement can thus result, in part, from common expectations 
among judges. Fensterheim & Tresselt (35) demonstrate a significant influ- 
ence of the value system of the judges on judgments based on photographs. 
Gage (44) reports that the prediction of responses of subjects on the Kuder 
Preference Record are more accurate when made on the basis of stereotype 
(expectations?) than when made after observing subjects in a series of brief 
situation tests. 

This finding suggests the possibility that at least part of the often re- 
ported agreement between supposedly independent judgments may result 
from the fact that judges use a common “‘stereotype”’ as the basis of their 
evaluations. That this is more than a possibility was shown by Malcom 
(83) in an unpublished doctoral dissertation. Using data from the Michigan 
assessment study, and a technique of profile correlation, Malcom computed 
the congruence between sets of ratings assigned each subject by assessment 
staff members on the basis of different techniques. He found that much of the 
“apparent validity” of the rated profiles based on specific techniques was a 
function of a common stereotype of ‘the prospective clinical psychologist.” 
In fact, for certain of the ratings, a ‘‘modal profile” turned out to be more 
predictive of the criterion profile than were the unique evaluations of indi- 
vidual subjects! 

The danger that judgments or ratings may be contaminated or reflect 
spurious cues is dramatically shown by Hemphill & Sechrest (64) in a study 
of the relation of different criteria of aircrew effectiveness in combat over 
Korea. One of the criteria was the actual bombing error objectively meas- 
ured from aerial photographs. In spite of the fact that the reliability of this 
objective criterion was zero, it was found to correlate .80 with the ratings 
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of superiors on “objective achievement.” Further analysis showed that this 
correlation was generated by the fact that the raters each had access to the 
same official bombing records, and, consequently, based their ratings pri- 
marily on shared but unreliable information. This resulted in spuriously 
high interjudge agreement and also spuriously high correlations of the judg- 
ments and the objective but completely unreliable criterion. 

A more direct analysis of the basis of clinical judgment is reported by 
Forer & Tolman (39) who asked 30 clinical psychologists to judge the value 
of each of 100 items in a sentence-completion test. In general, the judges 
assigned higher values to those items which appeared to elicit reactions 
toward specified situations and which seemed to throw light on the causes 
of emotional states. They also report that confidence of ratings was associ- 
ated more with extreme than with moderate judgments. Davenport (28) 
studied the ‘“‘semantic behavior” of six clinical psychologists in the evalua- 
tion of six TAT? records from a heterogeneous group of subjects. Each judge 
was asked to decide whether or not each of 207 typical interpretive state- 
ments applied to each TAT record. To provide an index of the ambiguity 
of the statements, each statement was rated by 26 other clinical psychologists. 
She found (a) relatively little agreement among judges in the differential 
use of the statements for the six TAT records and (5) that the judges tended 
to apply statements rated as universal to almost any patient while avoiding 
the use of more specific statements. Analysis of the statements rated as the 
most ambiguous showed them to be more heavily loaded with psychoanalytic 
terminology. Davenport also found that her subjects rarely made statements 
about positive traits or assets of personality even though some of the TAT 
records were obtained from normal subjects. 

Klieger & Mosel (74), in a study of performance ratings in a training 
division of the army, found no difference in the reliability of the ratings of 
judges indicating greater as compared with those indicating less opportunity 
to observe the specific trait rated. In general, ratings by superiors showed 
higher reliability than ratings by peers, regardless of the degree of familiarity 
claimed. The authors suggest that the higher interjudge agreement among 
superiors may be a function of greater communality of perceptual habits in 
the observation and categorizing of the particular variables rated in this 
situation. Conceivably peer judges might show higher agreement in rating 
other variables, e.g., those not related to behavior in the military situation. 


STUDIES OF PREDICTIVE VALIDITY BY TYPE OF CRITERION 


Relatively few published papers provide evidence on the predictive 
validity of techniques for specific criteria. In this section, we shall review 
those noted, grouped by the type of criterion predicted. 

The course and outcome in psychotherapy.—One of the most important 
and frequent assessment problems confronting the clinical psychologist in 
mental hygiene clinics is that of evaluating patients for critical decisions 
regarding psychotherapy. Several studies dealing with this problem appeared 
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during the past year. All investigators relied almost exclusively on a single 
technique, the individual Rorschach. Some of the studies were concerned 
with predicting continuation-termination of therapy, others used improve- 
ment in therapy as a criterion. 

In 1951 Rogers, Knauss & Hammond (93) published a study showing only 
chance differences in 99 Rorschach “signs” for mental hygiene patients 
staying in therapy for less than five as compared with more than five inter- 
views; they also found no relationship between clinical judgments based on 
Rorschach protocols and termination in therapy. Kotkov & Meadow (75), 
using nine interviews as the criterion of continuation in therapy, analyzed 
the Rorschach records of 98 patients in group psychotherapy and calculated 
discriminant function weights (to eight decimal places and six significant 
figures!) for three out of an unspecified number of Rorschach scores. Applica- 
tion of these weights and a cutting score to a new group of patients in indi- 
vidual therapy resulted in the correct prediction for 69 per cent for 52 cases 
(P=.01). However, for this group of patients one of the three Rorschach 
variables, D%, not only failed to discriminate but the direction of the dif- 
ference was reversed. P values for the other two variables, R and FC-CF, 
were .01 and .02. Parenthetically, it is of interest that these authors criticize 
the use of the “sign method” as violating the Gestalt quality of classical 
Rorschach interpretations but defend the use of the discriminant function 
method and theorize about the meaning of the variables weighted in their 
formula. 

Auld & Eron (4) applied the Kotkov & Meadow formula to 33 patients 
in another clinic and used the same criterion for continuation in therapy, 
nine or more interviews. They found the formula predicted correctly for 
only 52 per cent of their cases, this in spite of the fact that the tetrachoric 
correlation between R and the criterion was .55. In this study, FC-CF (the 
only other Rorschach variable which had held up in Kotkov & Meadow’s 
cross-validation attempt) showed a difference (P<.10) in the wrong direc- 
tion. Furthermore, for 23 of the 33 patients to whom the W-B had been 
given, the correlation between verbal I.Q. and the criterion was .71. Auld 
& Eron conclude by emphasizing the need for cross-validation and validity- 
generalization studies of proposed methods of diagnosis and formulae for 
predicting behavior. 

Rogers & Hammond (92) in another study evaluate the accuracy with 
which outcome in psychotherapy is predictable from the Rorschach. The cri- 
terion groups were 59 patients judged as improved and 50 patients judged 
as unimproved at the termination of therapy. As in their earlier study 
these investigators first used clinical judgments of the total Rorschach 
record. Categorized judgments (successful-unsuccessful) were made by three 
experienced clinical psychologists under four conditions: (a) a snap judgment 
(in two or three minutes); (6) a studied judgment using as much time as 
desired; (c) the same as (b) but with knowledge of the name of the therapist; 
and (d) from knowledge of the therapist only. Although both the intrajudge 
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and interjudge agreement were generally good, none of these global judg- 
ments showed other than chance relationships with the criterion. 

Next, the Rorschach scoring weights previously reported by Harris & 
Christianson (60) were applied. No significant discrimination was found 
between the scores for the two groups of patients. Rogers & Hammond then 
analyzed 99 Rorschach scores or indices in relation to the criterion. All of the 
resulting differences were chance only. Similarly negative findings were 
found for certain a priori combinations of variables. 

Still undaunted, Rogers & Hammond (92) derived a set of empirical 
“‘rules’’ from the data from this group of patients and tested their validity on 
a cross-validation group. They report that: ‘“‘the rule most useful permitted 
prediction in about one third of the cases examined, and showed correct 
predictions at the 1 per cent level of confidence.’’ This reviewer cannot but 
wonder how even their best rule would work for the patients studied by 
Auld & Eron. 

Prognosis, i.e., outcome of treatment of hospitalized neuropsychiatric 
patients was used as the criterion in an investigation reported by Filmer- 
Bennett (36). Using 12 matched pairs of improved and unimproved patients, 
he found significant differences (at the .20 level) for only 9 of 60 Rorschach 
signs. (Twelve of the 60 would have been expected by chance.) Even though 
none of these nine signs was regarded as warranting using it alone, ‘‘certain 
constellations of the nine appeared promising.’’ However, after testing these 
patterns with a new sample of improved and unimproved patients, the au- 
thor ruefully concludes: ‘‘There are no prognostic signs common to the 
Rorschach records of both psychotics and nonpsychotics which have the 
predictive accuracy claimed for the Rorschach signs reported in earlier stud- 
ies.” 

In one study, the predictive validity of several Rorschach variables 
and of certain other test scores shows up in a much more favorable light. 
Stotsky (108), using two samples of matched groups of remitting and non- 
remitting schizophrenic hospitalized patients, made a series of predictions 
regarding their differential psychological test performance. Practically all 
of his predictions were confirmed with respect to the direction of the dif- 
ferences of scores for the two groups, several of them at a respectable level 
of significance in both samples. In general, the test records of the remitting 
groups reflected impairment in intellectual and emotional function, tending 
to resemble more those of neurotics than hospitalized schizophrenics. In 
spite of these significant differences in the test variables, some of which ap- 
pear large enough to permit useful prediction of remission in schizophrenics, 
it should be noted that the nonremitting group had a history of more previous 
shock therapy. Also of interest is Stotsky’s finding that although certain 
Rorschach variables showed significant differences for these two groups of 
patients, the total Rorschach protocols could not be sorted better than chance 
by six Veterans Administration trainees. 


Sales performance.—Bills & Taylor (13) report that over achievement in a 
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sales school, i.e., greater than that predicted from an intelligence test, is sig- 
nificantly related to three different criteria of later sales success. Wallace & 
Twichell (115) show that the early (first few months) performance in life 
insurance selling is such an accurate predictor of later success that they urge 
its use for the advantage of both the company and the salesmen. 

Spencer & Worthington (106) describe in part an elaborate procedure for 
projectively evaluating the Worthington Personal History form and report 
a validity of .34 against tenure and .31 against amount of sales for a group of 
house to house salesmen. 

The highest validities of the year are those reported by Tobolski & Kerr 
(113) who report that the Empathy Test scores of 23 auto salesmen corre- 
lated .71 with rankings by the sales managers and .44 with an objective 
index of sales success (cars sold divided by attempts). Interesting is the fact 
that scores on this same test were found by Van Zelst (114) to show similarly 
high validities for several criteria of performance of labor union officials. 

Achievement in college and professional schools.—Nearly a score of papers, 
of which that by Melville & Frederiksen (87) is typical, were concerned with 
the prediction of success in academic or professional school programs. In 
general, high-school standing continues to be the best single predictor of 
college grades, followed closely by a score on some test of scholastic aptitude. 
Several of the studies show that for the prediction of performance in pro- 
fessional schools, some measure of interests (motivation?) will improve the 
multiple R obtained. 

Turning to the Menninger project, in which the criterion is over-all com- 
petence in psychiatry, we have already noted the extremely low validities 
found for the selection interview. Although it will be some time before the 
full report is available, Holt & Luborsky (67) also report additional samples 
of the predictive validities being obtained for other techniques. Judgments 
based on the TAT, which had earlier been reported as yielding very promis- 
ing validities, are now reported as —.03, .00, —.02, and .10 depending on the 
judge and the criterion. For the Rorschach alone, the comparable values are 
.26, .22, .02, and .05. That the criterion contains predictable variance is indi- 
cated by predictive validities of .30 and .50 for two judges using all available 
data (Credentials, W-B, a Word Association test, Rorschach, TAT, Self- 
Interpretation, Picture Reaction Test, and Interview). Although this project 
relied very little on objective tests, it is of interest to note that validities of 
—.29 and +.24 were found for two Strong Vocational Interest Scores, Pro- 
duction Manager and Lawyer. 

Performance in social work training is the criterion used by Frankle in a 
research program for which as yet but preliminary reports are available. 
Frankle has been particularly interested in predicting the field work per- 
formance of social work students as distinguished from academic grades. In 
a symposium at the 1951 American Psychological Association meetings, he 
had reported that this criterion was predictable with very respectable validi- 
ties in two independent samples, by two very different predictor variables. 
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The first was scores on the Michigan Speed of Reading Test, which showed 
correlations of approximately .50, even though this particular criterion was 
not predictable by intelligence tests. The second promising predictor was a 
set of scores derived from the Minnesota T-S-E Inventory. Although none of 
these scores yielded by this inventory showed useful validities, a ‘‘persevera- 
tion score’’ (based on the number of times the subject gives the same answer 
to successive pairs of items) yielded validities of —.63 and —.44 for the two 
samples. 

Frankle’s ingenuity in locating still other predictors of this same criterion 
is shown in a further report (40) dealing with objective indices derived from 
the Harrower Group Rorschach. Whereas H and Hd scores looked promising 
in one small sample, these did not hold up in cross-validation samples. 
Analyses of M+ scores showed more promise in all groups, but still yielded 
only low validity. However, another objective score (the M+ latency index) 
derived by weighting M+ responses by the position in the sequence of re- 
sponses to each card, looks still more promising. In fact an empirically de- 
rived “‘pattern’’ utilizing this index as the key variable yields validities of 
.42 to .72 for different groups and different criteria. That the human move- 
ment Rorschach variable, objectively scored and empirically validated, may 
have considerable general validity is suggested by the fact that M+% was 
reported by Kelly & Fiske (71) to correlate significantly with most of their 
criterion measures for clinical psychologists even though none of the global 
predictions based on the Rorschach was significant. 

Teacher performance-—Gough & Pemberton (52) found no significant 
relationships between grades in practice teaching and conventional scores 
derived from the MMPI? or group Rorschach; but empirically derived sign 
approaches to the MMPI yielded a P value of .05 in cross-validation. Callis 
(19) reports that scores on the Minnesota Teacher Attitude Inventory cor- 
relate .49 with pupils’ evaluation of the teacher using a 47 item inventory. 
Two judges evaluated the same teachers and although their ratings corre- 
lated only .33, their combined rating was predicted by the Inventory scores 
with an 7 of .40. When the ratings by principals were used as the criterion, 
the validity of the inventory scores was only .19. 

Other criteria.—Martin (85) reports an r of .37 between the score on a 
simple number checking test (same-different) and the accuracy with which 
300 checks were listed on an adding machine after 10 to 12 hours of practice; 
the reliability of the criteria being .71. Seashore (101) reports significant va- 
lidities for the prediction of clerical performance in banks but calls attention 
to relevant criterion problems encountered in validational studies in this 
field. For International Business Machine operators, Gadel & Kriedt (43) 
report fair predictions of performance by aptitude tests. Job satisfaction of 
the same subjects was not predicted by any of the Kuder scores, but the 
authors report the development of a special International Business Machine 
Interest Test which held up well on cross-validation. 

The quality of student drawings was found by Crannell (25) to correlate 
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.59 with scores in the Meier Seashore Art Appreciation Test when the cri- 
terion was the pooled judgment of many fellow students. When the judg- 
ment of three artists was used as a criterion, the validity falls to .37 for the 
same subjects. Reliability of the two criteria were .90 and .80 respectively. 

Gough (51) reports the development of a scale for predicting Social Par- 
ticipation and validities of .35 and .55 using as a criterion the number of 
activities in four schools participated in by high school students. 

Hathaway & Monachesi (62) report on the follow up of a large scale study 
of 4,000 ninth grade children administered the MMPI in 1947-48. During 
the next four years, nearly one sixth of the group had one or more encounters 
with the police. This group included nearly three times as many boys as girls. 
Both sex groups showed significantly high elevations on the F, Pd, and Ma 
Scales of the MMPI. The delinquent girls also showed high Hy, Pa and Sc 
scores. Gough & Peterson (53) describe the development and validation of a 
promising inventory designed to assess predisposition to crime and delin- 
quency. Clark (23) reports additional validation data based on his AWOL 
Recividist Scales for a small group of MMPI items. 

Driscoll (29) was concerned, not with the assessment of delinquency, but 
with predicting the institutional adjustment of prison inmates. Prisoners 
who adjusted were, in the judgment of officials and guards, significantly 
lower on the D, F, and Pa Scales of the MMPI. A modified TAT failed to 
differentiate the good and poor adjustment groups. 


TEstT ORIENTED STUDIES 


A large proportion of the published studies relating to assessment pro- 
cedures continue to be oriented around single tests rather than focused on (a) 
the prediction of a specific criterion or (b) the identification and measure- 
ment of a particular variable. Instead of selecting or devising tools and 
techniques most appropriate and effective to the task, all too many authors 
seem to be motivated by an attitude “here is our instrument. Let’s see how 
many different variables we can infer from the behavior sampled by it or 
how many group differences we can find with it.” 

Whatever the reasons, the majority of the published papers continue to 
be most readily classifiable by tests or techniques. Even using such a classi- 
fication, it is not possible to undertake a meaningful summary of these papers 
in a brief review. In fact, the output is so staggering as to make one wonder 
if even specialists with the technique are able to read it. As an example the 
Fourth Mental Measurements Yearbook lists 618 references to the Rorschach 
alone as appearing between 1947 and 1951 inclusive! During the same period 
of time, 1947-51, 197 additional references were noted for the TAT, and for 
the more recently introduced Szondi 67 references had already been noted by 
the end of 1951. 

Several new texts in clinical psychology were published during the year; 
all include discussions of the more popular tests used in diagnosis. The most 
comprehensive treatment of this group of techniques will be found in Weid- 
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er’s Contributions toward Medical Psychology (116). In Volume II of this 
work, devoted entirely to psychodiagnostic methods, some 30 specialists 
describe the techniques with which they are most closely identified. 

Rorschach.—Although the writer does not claim to have digested all of 
the more than 100 recent Rorschach references, they appear to reflect a 
trend toward a somewhat more critical evaluation of this technique than 
characterized studies of previous years. In only a few of the studies did the 
author rely on “clinical interpretation” as the main source of data. As usual, 
several papers were concerned with Rorschach signs as related to diagnostic 
categories. Although most authors are able to report a few ‘‘significant’’ 
differences, most studies report a very large amount of overlapping of groups 
known to be widely different. Furthermore, there seems to be relatively 
little consistency of the findings of different investigators. Friedman (41) for 
example, compared the Rorschachs of hospitalized schizophrenics and 
normals, and although he found statistically significant differences for eight 
variables, these were so generally small as to lead him to this final conclu- 
sion: 


It would seem, then, that consideration should be given to the redefinition of the 
customary Rorschach scores or the construction of new ones. In order to be most 
fruitful new variables should reflect vectors of a systematic and comprehensive theory 
of personality rather than isolated traits or independently considered aspects of 
psychological functioning. 


Nearly a score of the authors contributing to the Rorschach literature 
during the past year were concerned only with factors affecting the Rorschach 
performance treated as an end in itself. Thus, Siipola & Taylor (105) report 
that ‘‘pressured instructions” result in more “symptomatic’”’ responses; 
Gibby (48) reports that the examiner as well as the cards influence the 
the obtained protocol, especially that part of it obtained in the inquiry, and 
Klatskin (73) found that the testing situation (hospital versus factory) 
affects the Rorschach record. Maradie (84) employing a series of randomized 
sequential card orders found that position in the sequence is positively 
correlated with number of responses but that some cards elicited more re- 
sponses than others regardless of position. Allen et a/. (1) on the other hand, 
reports no relation between sequence of presentation and responses to the 
cards. Two or three additional papers this year report no relationship be- 
tween color and responses. 

Fiske & Baughman (37) in a detailed analysis of the relationships be- 
tween Rorschach scoring categories and the total number of responses, 
found many significant relationships, several of them nonlinear. In their opin- 
ion, the computation of per cents by dividing by R is only a partial solution 
to the problem; they would prefer to see the development of new and 
better measures of the promising Rorschach variables. 

Just what Rorschach variables are most promising and what they really 
are would still seem a moot question, especially in view of an exploratory 
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study reported by Lotsof (81), who factor analyzed the intercorrelations 
among 10 variables: a test of verbal intelligence, three measures of verbal 
fluency derived from recordings of oral descriptions, and seven Rorschach 
variables believed to be relatively unaffected by verbal fluency. Although 
based on relatively few cases, four fairly clear-cut factors emerged. The 
loadings of the Rorschach variables strongly suggests the possible need for 
a revision of scoring categories and conventional interpretations of the Ror- 
schach determinants. 

In spite of these and other studies which raise certain doubts regarding 
the validity of conventional Rorschach interpretations, the year also saw 
the publication of several new texts. These include a third volume by Beck 
(12) and from the Gesell Institute of Child Development a book entitled 
Child Rorschach Responses (2). The subtitle here is ‘‘Developmental Trends 
from Two to Ten Years.’ This is essentially a book of age norms based on 
50 children, 25 of each sex representing half year levels from 2 to 6 and year 
levels from 7 to 10. A subsequent publication is promised to cover years 10 
to 16. 

While the present reviewer does not pretend to the kind of expertness 
needed to evaluate these texts, he does feel qualified to at least question 
the impact of the last mentioned one. It can be predicted with reasonable 
confidence that this “book of norms” will lead to marked increase in the 
use of the Rorschach with children. How unfortunate, then, it would seem 
that the authors chose to base their ‘“‘norms’’ on subjects anything but 
representative of the population with respect to intelligence and social 
status. Actually over three-fourths of the children fall in the professional 
and managerial group and a like proportion are in “above the average 
categories” in intelligence. In view of the known relationships between 
intelligence and R and between R and many other scoring categories, such 
“norms” seem likely to be most misleading. Of equal concern is the authors’ 
completely uncritical acceptance of the conventional interpretations of 
determinants even when applied to the records of children. 

Szondi.—During the past year, the Szondi seems to have replaced the 
TAT as the second most frequently studied projective device. Most of the 
Szondi articles deal with specific hypotheses regarding the stimulus value of 
the cards; only a few are in the nature of validational studies. The simplicity 
with which this test can be administered, the objectivity of the scores, and 
conflicting theories of its proponents will probably lead to many more 
papers. For anyone planning research with this instrument, the excellent 
Szondi review and critical evaluation by Borstelmann & Klopfer (16) is a 
must. This article contains not only reviews of previous work on the Szondi 
test but includes a thoroughly sophisticated discussion of the theoretical 
and methodological issues of personality testing and a plea for rigor in 
research on the validation of projective devices. The authors regard the evi- 
dence thus far available as not supporting the theories of either Szondi or 
Deri and suggest the need for systematic research which will permit, first, 
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the reclassification of the cards in terms of their empirically determined 
stimulus values, and second, the validation of differential patterns of re- 
sponses to the cards. In other words, pictures of human faces would seem 
to offer considerable promise as alternatives to verbal symbols as items for 
a “personality inventory” but validation of the resulting scores would fol- 
low by relating test performance to external criteria. These authors conclude 
their review as follows: ‘‘Any clinical instrument is not a shibboleth to be 
defended or refuted, but a provocative tool which requires careful investiga- 
tion as to the nature of its contribution to the understanding of human 
behavior.” 

Publication of such articles, together with other evidence of increasing 
methodological sophistication on the part of persons doing research with 
projective tests should result in markedly improving the quality of future 
research in this domain. 

Drawing techniques.—On the basis of the number of papers seen, a close 
competitor of the Szondi are the various techniques utilizing drawings of 
one sort or another. Among these, the most popular seems to be the Draw- 
a-Person technique, and some of the investigators use highly objective 
procedures for testing hypotheses. For example, Weider & Noller (117) 
find that 61 per cent of younger as compared with 48 per cent of older 
children use the upper-left quandrant of the paper. Does this mean that 
adults who do so are more regressed? 

The next most popular drawing test seems to be the House-Tree-Person. 
An example of the ridiculous inferences which may result from a priori 
interpretation of drawings without norms is contained in a publication by 
Hammer (58) who asked three clinicians to evaluate the over-all adjustment 
of large samples of white and Negro children using only the H-T-P drawings. 
Although the reported interjudge reliabilities are very high, the median 
rating assigned Negro children was ‘‘severe neurotic’’ or ‘‘borderline psy- 
chotic” at all grade levels. White children fared somewhat better but still 
the median rating for the junior high school age groups was ‘‘neurotic.”’ 

TAT.—Relatively few new TAT papers appeared during the year. One 
showed that TAT stories told can be matched with greater than chance 
accuracy even by untrained judges. Two studies (54, 118) showed that modi- 
fication of the TAT pictures by making the central figure more similar 
to the subject do not elicit greater productivity. Fry published a new 
scoring blank and manual (42). Shipley & Veroff (104) report the develop- 
ment of a measure of Need for Affiliation, and Clark (24) reports an attempt 
to use the TAT to measure Sexual Motivation. 

Bender-Gestalt—Turning to the Bender-Gestalt references, one reports 
a significant difference (but still large overlap) between schizophrenics 
and normals in Pascal-Suttell scores, another no relationship between 
recall of B-G figures and Shipley Hartford Scores, another analyzes the 
41 Billingslea variables in B-G into six correlated clusters. Suttell & Pascal 
(110) report the use of the B-G in a study of regression in schizophrenia. 
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They conclude that regression as measured “is not peculiar to the schizo- 
phrenic.” 

Other techniques.—Studies of individual intelligence tests are still numer- 
ous. A number of papers agree that the Wechsler Intelligence Scale for 
Children is a pretty good intelligence test. Although we seem to be getting 
out of the period when the Wechsler-Bellevue is being used to evaluate 
almost everything but intelligence, there are still a few papers on scatter 
and a few authors still looking for ‘‘signs’’ of reading difficulty, delinquency, 
and deterioration, etc. Hunt & French (69) summarize their developmental 
work on the CVS scale and offer it not only as a good short test of intelligence 
but as a coarse diagnostic screen. Bluett (14) reports normative data fora 
battery of tests found useful in the rehabilitation of handicapped adults. 

Using the Rosenzweig Picture Frustration technique, Holzberg & Hahn 
(68) compared adolescent psychopaths and normals, finding no differences 
in hostility and guilt. Another study fails to confirm previously reported 
findings of differences in stutterers and nonstutterers. In another paper, 5 
out of 18 variables were found to differentiate anxiety neurotics and normals 
at the .05 level of significance. 

As contrasted with the single technique approach reflected in so many 
papers, Lehmann & Dorken (77) report the use of a battery of projective 
tests. Although composed of four short tests, all of which can be scored 
relatively objectively, this battery shows promising discrimination among 
diagnostic categories. 

As usual, several papers proposed new projective techniques, which to 
their creators, at least, ‘‘appear promising.’’ To this reviewer, Kinget’s 
“Drawing Completion Test’’ (72) seemed the most promising, perhaps be- 
cause the author presented considerable data based on the technique. 

Among the paper-and-pencil inventories, the MMPI was the most fre- 
quently studied instrument, followed closely by the Strong Vocational 
Interest Blank. Both instruments continue to show promise of utility for 
certain assessment situations, but in some instances the predictive validity 
is so low as to suggest the desirability of searching for other predictors 
instead of relying entirely on a favorite one. Typical papers are those by 
Williams (119) reporting the development of a ‘‘Caudality Scale’’ for the 
MMPI and that by Kriedt, Stone & Paterson (76) on the Vocational In- 
terests of Industrial Relations Personnel. Both Navran (88) and Healy & 
Borg (63) report no validity of interest scores for the prediction of success 
in nurses’ training. 


METHODOLOGICAL CONTRIBUTIONS 


The failure of self-report inventories to show predictive validities in 
some selection situations has led to a number of efforts to identify and correct 
for “faking” on the part of the subject. A paper by Schlesser (99) suggests 
a number of possible solutions to the problem. Another approach lies in the 
development and use of more “subtle” items as suggested by Seeman (102) 








306 KELLY 


and as followed by Gough (50) who in the development of a new M-F scale, 
eliminated items for which judges could recognize the characteristic M or F 
response. 

Still another approach to the problem of ‘‘faking”’ on self-report inven- 
tories is the development of ‘‘objective tests’’ measuring the same variables 
as traditionally approached by the Yes-No type of item. The development 
of such tests calls for ingenuity, but that it is possible is shown by the 
success of Cattell & Horowitz (21) in the construction of a series of short 
objective tests of “‘altruism’’ most of which turn out to be highly saturated 
with two or three personality factors previously identified through ratings 
and inventories. 

Much of the reluctance of many psychologists to use objective measures 
of personality and statistical combination of such measures appears to stem 
from a conviction that the most accurate representation of an individual's 
personality and the most accurate prediction of his future behavior requires 
the use of mathematical models other than the ‘‘additive’’ one embodied 
in the widely used multiple regression technique. That an objective and 
statistical approach need not be limited to this or other specific models is 
reflected in the increasing use of procedures for quantifying the congruence 
of profiles, summarized by Cronbach & Gleser (27), and in the development 
of promising techniques of configural analysis of both profiles and item 
responses as suggested by Sullivan & Welsh (109) and by Gaier & Lee 
(46). A still more dramatic demonstration of the possibility of objectifying 
and statistically analyzing variables which the clinician regards as relevant 
is the P-technique factor analysis of the day-to-day variation in motives in 
a single subject reported by Cattell & Cross (20). 


CONCLUSION 


In this review of the theory and techniques of assessment the writer 
has attempted to report the present state of affairs as objectively as his 
obvious biases would permit. In doing so, it has been necessary to admit 
an almost complete absence of relevant theory and a discouragingly large 
number of studies yielding negative results. In spite of extensive efforts in 
the field, as reflected in an almost fabulous number of publications, an 
unfortunately large proportion of the researches reported do not seem to 
make any real contribution to theory or to practice. Furthermore, all evi- 
dence points to the fact that much of current practice involves the use of 
tests and techniques for which there is almost no evidence of predictive 
validity for the relevant criterion. 

In spite of this dismal over-all picture, the reviewer is not discouraged 
regarding the future possibilities of assessment. Already, there is evidence 
that it is possible to predict a number of very different socially relevant 
criteria with a sufficient degree of accuracy to be useful. While the propor- 
tion of variance in the criterion accounted for may still not be large, there is 
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reason to believe that psychologists have the necessary creativity, ingenuity, 
and skills to develop still better measures of both predictor and criterion 
variables. Even more important, assessment psychologists are peculiarly 
equipped with the methods and skills to carry out both the necessary evalu- 
ation of present procedures and crucial tests of new procedures suggested by 
advances in personality and measurement theory. To the extent that they 
utilize this unique competence, we may expect continued progress in the 
prediction of socially significant criteria. And to the degree that the predic- 
tion of real life behavior constitutes the acid test of the validity of hypotheti- 
cal constructs, research in assessment may serve a “reality testing’ function 
for theoretical developments in psychology. 
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By GEORGE SASLOW 


Departments of Medicine and Neuropsychiatry and Division of Psychosomatic 
Medicine, Washington University School of Medicine, Saint Louis, Missourt 


Professor Wrenn (author of the chapter on Counseling) and I have made 
no attempt to delimit counseling and psychotherapy by means of a defini- 
tion. We are inclined to consider such an attempt as little likely to succeed 
as the recent search for a definition of psychoanalysis [Committee on Evalu- 
ation of Psychoanalytic Therapy (1)]. We shall therefore leave psycho- 
therapy ‘‘as an undefined technique which is applied to unspecified problems 
with nonpredictable outcome” [Lehner (2)]. This chapter will concern itself 
with the therapeutic process in medical settings, and with conceptual, 
clinical, and research activities in the field of therapy which seem highly 
relevant to such settings. The available materials for the period from May, 
1952, to May, 1953 will be dealt with under the headings individual psycho- 
therapy as process, individual psychotherapy in relation to outcome of 
therapy, multiple psychotherapy, group therapy, and general comments. 
Emphasis will be more upon critique than upon bibliography. 


INDIVIDUAL PSYCHOTHERAPY: As PROCESS 


The possibility of predicting which patients will continue in psychother- 
apy after the ninth interview has been studied by means of Rorschach scores 
[Auld & Eron (3)]. A proposed prediction formula, combining the total 
number of responses R, the percentage of these that are large usual details 
D%, and the number of form-color responses minus the number of color-form 
responses FC—CF, had been derived from 98 patients receiving outpatient 
group psychotherapy and subsequently applied successfully to another 
sample of 52 patients in individual therapy in the same clinic [Kotkov & 
Meadow (4)]. The developed formula was tried with a new sample of 33 
psychiatric clinic patients in another city. It failed to predict which patients 
would continue individual psychotherapy and which would not. Auld & 
Eron therefore emphasize that, until cross validation and validity-generali- 
zation studies have been done, no confidence can be placed in the general 
applicability of any method of diagnosis or any formula for predicting be- 
havior. 

An improvement in the assessment of certain changes in individuals 
during psychotherapy is suggested by Butler (5). Procedures used so far 
by the Rogers group involve the construction of categories of recorded 
verbal behavior to be studied through time, counting the frequencies of 
behavior within the categories, and expressing these as percentages. Butler 


1 The survey of the literature to which this review pertains was completed in 
May, 1953. 
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points out that the categories set up generally neglect the context of be- 
haviors outside them and that such neglect leads easily to unjustified in- 
ferences from the numerical data. He therefore proposes a ‘‘context coeffi- 
cient’’ which does take account of the context of behaviors outside the cate- 
gories being explored, and reduces to the narrower type of coefficient when 
noncategory responses are zero. 

Raimy’s method of studying self-references during psychotherapy has 
been extended by Bugental (6). One recorded case, the “Herbert Bryan” 
case in Rogers’ 1942 book, was examined as an example of the new procedure. 
The thought units are divided into six self, not-self categories, called the 
Conceptual Matrix. The feeling tone of each unit is evaluated in terms of 
four categories (positive, negative, etc.). Interrater reliability was adequate. 
Consistency of the rank order of the categories in the Conceptual Matrix, 
from interview to interview, was high. The sample analyzed indicates the 
possibility of a fuller description of the patient’s verbal participation in the 
therapeutic process in objective and quantified terms. The material is 
permanently available for the study of hypotheses, for making comparisons 
with other patients, examining the patient’s achievement of defined goals of 
therapy, etc. 

Various methodological approaches to the study of therapy as a process, 
with indications either that work is in progress or that unexpected difficulties 
have been encountered, are described in the recent book edited by Mowrer 
(7). Nearly all the material referred to in the book has already appeared in 
print, but the unusually high level of organization of it in various chapters 
is provocative and rewarding. Review of studies (since 1943) of the process 
of client centered therapy leads to the inference that they have been mainly 
studies in conceptualization; they have helped build ways of looking at ther- 
apy which have added to the theory-building process. They are not self- 
contained validity studies of therapy, but represent stages in the acquisition 
of knowledge which will need subsequently to be verified through independ- 
ent measures of the phenomena examined [Seeman & Raskin (8)]. The con- 
siderable amount of work which has been done since 1947 with the discom- 
fort-relief quotient (DRQ) is assessed by Dollard & Mowrer (9) and by 
)}Mowrer, Hunt & Kogan (10). The DRQ appears to be a reliable and valid 
indicator of other objective measures of personal change during therapy and 
‘of therapeutic success as estimated by psychotherapists. But it is not valid 

Yfor social casework, the procedure from which it was derived, for reasons 
not now evident. Nor does it appear to be correlated with “tension” in 
schizophrenic subjects, again for reasons not evident. A description, histori- 
cal outline, and critique of the procedures known as “Q technique”’ are 
‘given by Mowrer (11). Of major import is his demonstration that contem- 
porary research in the field of personality is moving rapidly toward the 
development of a valid statistics of the individual case. Subdivisions of the 
developments in the general field are presented by Cronbach (12) in ‘‘Corre- 
lations between persons as a research tool,’’ Luborsky (13) in “Intraindi- 
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vidual repetitive measurements (P technique) in understanding psycho- 
therapeutic change,” and McQuitty (14) in “‘A statistical method for study- 
ing personality integration.’’ These four chapters in Mowrer’s book, which 
do not permit easy summary, suggest that methods of study of individuals 
in clinical situations may soon be more clearly related to traditional research 
methods that use a statistics of groups. New ways of looking upon verbal 
behavior in psychotherapy are discussed by Mowrer (15). He considers them 
to have possibilities in the study of the process. Among them are the relative 
frequency of different parts of speech (pp. 503-21, 526-30); percentage of 
references to others and to self (pp. 522-24); content analysis through 
time to show the rise and fall of expressed concern with various problem 
areas (pp. 524-25); changes in the meanings of key words during therapy 
(pp. 530-36); study of consistency and contradiction during therapy (pp. 
536-44), etc. All of the procedures have interesting aspects, and none has 
been supported by adequate systematic study. The beginnings of such 
study are described in a chapter dealing with tension, discomfort-relief 
quotient, and palmar sweat changes during psychotherapy, with special 
reference to resistance [Mowrer et al. (16)]. Transcribed interview recordings 
of 20 subjects are being used (the study is as yet unpublished). ‘‘Tension”’ 
was reported by each subject on a five-point scale, just before and after 
each interview; ‘‘happiness,” similarly on a seven-point scale. Persons who 
completed therapy (n, 15) were distinguishably and appropriately different 
with regard to these simple measures from the five who did not. In 10 
cases, subjective ratings and objective perspiration measurements were 
available; these showed good congruence, and thus both methods seem to 
have high face validity. Striking discrepancies occurred, which proved 
illuminating on detailed examination of the interview material: in one such 
case, the palmar perspiration measure appeared congruent with the inter- 
view material and the self-ratings were not. In the five recorded cases, the 
correlation between discomfort-relief quotient and palmar perspiration was 
.54; the changes were thus in the expected direction. In a chapter canvassing 
the possible inclusion of physiological changes during therapy as a necessary 
part of study of the process, attention is called to the so-far relatively unex- 
ploited use of skeletal muscle action potentials [O’Kelly (17)]. 

An experimental study of two definably different psychotherapeutic 
techniques has been published by Grossman (18). Two groups of 10 males 
each were matched for interest in obtaining therapy, temperament test 
scores, verbal and reasoning ability, etc. Each subject had three 60-min. 
interviews, the last being recorded. The therapist’s statements were classi- 
fied by himself and another judge; the therapist rated each subject’s insight 
and progress on a five-point scale after the third interview, a day or two later, 
and four weeks later. Two hypotheses were examined: (a) subject’s insight 
is greater following therapist responses that recognize explicitly expressed 
feelings or attitudes; (b) subject’s insight is greater following therapist re- 
sponses that recognize feelings or attitudes inferred from subject’s verbal 
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or motor behavior (‘interpretations’). For some unaccountable reason, the 
desired matching was not fully achieved in the subjects’ interview behavior: 
seven of the 10 in one group did not behave as if they had a definite prob- 
lem. Despite this unforeseen difficulty, the data warranted accepting hypothe- 
sis (b) and rejecting (a). It seemed clear that the independent variable 
(therapist behavior) could be manipulated in a predetermined manner, that 
objective criteria could be used, and that several dependent variables could 
be held constant. Similar observations, apparently originally part of a doc- 
toral dissertation, have been published by Dittman (19). Several judges 
evaluated 965 recorded verbal responses of one subject in 30 successive inter- 
views with one therapist. Therapeutic movement (as defined in terms of 
progressive self-exploration) was found to be associated with (a) high level 
of participation by the therapist (in the sense of completeness of under- 
standing of what the subject is communicating), (b) either response to feeling 
or to behavior of subject with therapist, or to both, provided the response 
occurs in the context of consistent response to both feeling and interpersonal 
behavior, and (c) therapist responses which were “‘slightly deeper’’ than pure 
reflection. 

In a study bearing upon the above topic, the nature of psychotherapeutic 
interpretation was examined by Porter (20). He utilized transcribed client- 
centered interview material. He found that the interpretations by therapists 
of patients’ verbal and gestural productions could be grouped into five 
classes. He suggested that characteristic patterns in the use of favored 
classes would be found in therapists of different but known training frames 
of reference, but that so long as an interpretation relates accurately to the 
attitudinal state being expressed by the patient, therapy will take place. 
He thought it possible that each class of interpretation influences differently 
the speed and the pathways of conceptual and affective reorganization. 

| The three papers just described provide a framework for the remark of 
Rogers (21) that he has “reluctantly come to concede the possibility that 
the process, directions, and end points of therapy may differ in different 
1 erage orientations.”” Butler (22), in a detailed consideration of the 
jinteraction of client and therapist, compared client-centered therapy with 
classical psychoanalytic therapy from the point of view that therapists of 
} these two orientations may well set in motion quite different therapeutic 
\ processes. He focussed his comparison upon the hypothesis that “in two- 
way personal-social interaction, each individual concerned displays antici- 
patory behavior which in turn elicits anticipatory behavior on the part 
of the other, the anticipatory behavior of the other in each case confirming 
the prior learnings of the individual about himself.’’ These prior learnings 
have been learned early and slowly and are quite fixed. Resistance and 
transference are, in terms of Butler’s hypothesis, joint functions of the social 
learning of the client and of the stimuli presented by the therapist to the 
client. When, at a particular point in psychoanalytic therapy, systematic 
interpretation is undertaken, the patient perceives the stimuli produced by 
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the therapist as resembling those connected with his maladjustment, in 
that interpretation is similar to parental behavior. Since the therapist be- 
lieves the transference and resistance behavior to originate in the patient 
alone, he presents certain stimuli to him rather than others, and then the 
typical psychoanalytic phenomenon transpires: an interpersonal struggle 
which is the transference relationship. The sequence of behavior in client- 
centered therapy appears quite different: incipient transference and resist- 
ance reactions often appear early, but die out presumably because the ther- 
apist’s behavior continues to be directed toward understanding the frame 
of reference of the client as expressed verbally and nonverbally. Butler 
believes the particular behavior of client-centered therapists serves to deny 
and extinguish anticipations of the client with regard to the therapist. But- 
ler’s considerations make clear why each type of therapist is likely to be 
convinced of the inevitable course of therapy done his way, for the course 
of therapy is largely determined by patient responses evoked by the thera- 
pist. He remarks also that it is at present by no means certain that the same 
learnings must occur to produce a satisfactory therapeutic outcome: only 
the end product may be decisive for success. Grossman’s and Dittman’s 
results must be interpreted in the light of this remark. 

The clear definition of therapeutic orientations, as shown above, has 
obviously been productive of new knowledge about therapy as a process. 
But it seems to be quite difficult, at the level of practice, to keep the thera-| 
peutic orientations firm and easily separable. Thus client-centered therapists, | 
like other therapists, are faced with involuntary clients [Beier (23)]. A willing- | 
ness to deal with such clients implies expansion of the views of client-centered | 
therapy. Beier includes children, delinquents, prisoners, employees, and | 
persons referred by courts among involuntary clients, and gives a few brief | 
examples of three pretherapeutic procedures designed to assist such clients to | 
become motivated clients. It should be noted that Beier’s definition of an 
involuntary client, as one who engages in maladaptive behavior and who has 
some wish to free himself from such behavior but who does not seek the 
assistance of a therapist, is easily extended to encompass the common 
problem of a patient in a medical setting whose behavior is in one way or 
another relevant to his illness. Often such a patient has a strong wish to free 
himself from disabling medical symptoms, which are associated with 
maladaptive behavior, but he does not recognize the association, does not 
necessarily wish to alter the behavior, and does not seek a therapist for 
alteration of his behavior. Yet experience shows over and over that his wish 
to free himself from medical symptoms, combined with medical study (which 
excludes or gives proper place to alternative explanations) and prethera- 
peutic procedures which include but go beyond those described by Beier, 
are often successful in leading to long-term alterations of behavior through 
psychotherapy. Thus a determined refusal to expand the client-centered 
orientation probably implies refusal to benefit a great many clients by at- 
tributing in advance the alleged low probability of success entirely to the 
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client. Observations like those of Beier have also been made by Moustakas 
& Makowsky (24). They found a strictly nondirective approach to fail 
with a number of parents who came for help for their children. Slight changes 
from orthodoxy were effective in increasing their clients’ motivation (hence 
one should again point out the relevance of Butler’s hypothesis that the 
resistance of clients may be produced by the therapist’s technique). 

Starting with a different type of client, the college student, Perry & 
Estes (25) also reject the strictly nondirective approach. They present a 
carefully reasoned argument for abandoning the notion that directiveness 
and nondirectiveness are processes in a unidimensional continuum, in favor 
of the concept that they belong to different realms. They are regarded as 
complex processes with many possibilities of conflict and of coordination. 
It is postulated that ‘“‘any counselor-participation which is assimilated by 
the client to a set in which he perceives himself as ultimately responsible 
for initiative and evaluation is properly describable as client-centered.”’ 
Once the client has assumed this set, the counselor may put at his service 
(through activities conventionally excluded from the nondirective reper- 
toire) the knowledge and skills of that part of the culture in which the 
counselor is a specialist. 

The difficulty of holding to sharply drawn lines is brought home by 
cccasional instances that refuse to stay inside them. Thus, the advisability 
of undertaking psychotherapy against the will of the patient is discussed by 
Odenwald (26). A single case is described in which a delinquent boy ordered 
to have psychotherapy while on probation became, after probation was 
over, a well-motivated patient. An extraordinary case is described by Ring 
(27). A 51 year old man, severely ill with paranoid schizophrenic psychosis, 
for 30 years, had been considered completely hopeless. After a bimedial 
leucotomy, he had nine months of carefully planned treatment, including 
psychotherapy up to two hours a day, five days a week. He received much 
planned emotional support from his relatives and various hospital per- 
sonnel who were instructed to practice systematic nonresponse to his de- 
lusional behavior, and interested discussion of his lucid behavior. During 
therapy the patient’s ideation, behavior disturbances, etc., resembled 
markedly those of the Schreber case discussed by Freud. He was discharged 
after nine months and remained well for two years, at which time Ring’s 
report was written. Here leucotomy appears to have made the patient 
accessible to behavior therapy, which was highly flexible and not dependent 
upon either nondirectiveness or insight alone. Of interest also is a single 
case described by Sifneos (28). A young woman with anorexia nervosa, a 
disorder generally considered to be related significantly to early social 
learnings, had remained ill for seven years. Unsuccessful treatment had 
included psychoanalysis, electroshock, and insulin. Neurosurgical inter- 
vention, in the form of a unilateral lower quadrant right frontal leucotomy, 
was followed by much improvement. 

Unusual instances such as the three cited not only raise questions of 
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great general interest with regard to the dependence of new learning upon 
age, upon severance of thalamocortical pathways, etc., with regard to the 
extinguishing of old learnings in relation to severance of thalamocortical 
pathways, etc., but also challenge the framers of psychotherapeutic orienta- 
tions to fit them in. 

If we turn our attention to another therapeutic orientation, the psycho- 
analytic, from the point of view of therapy as process, we may begin with 
some conclusions of Cameron (29). In a discussion of the ‘“‘activity of the 
therapist in integrative forms of nondirective psychotherapy,” he portrays 
him as an active participant, contrary to much now believed and practiced. 
He considers free association (strictly defined in practice and expectations) 
a scientific tool congruent with nineteenth century scientific premises, but 
not with those of the twentieth century, and as having serious limits to its 
effectiveness. He describes the therapist as a social invention capable of 
many different kinds of actions and of a great number of actions, in addition 
to sitting, hearing, and interpreting. He mentions in outline four major 
categories of action a therapist may show: (a) resolving capacity, i.e., ability 
to identify patterns in the streaming events of psychotherapy, an ability 
of which we have little knowledge; (b) power, the ability to effect the 
reorganization of the patient’s handicapping behavioral patterns, about the 
working of which we know little; (c) acting as an agent of change, by the 
way in which he facilitates and maintains communication with the patient 
and supplies models of more effective behavior patterns; and (d) functioning 
as a working model, with whom a patient can safely try out hitherto unsuc- 
cessful relationships. These observations of a nonpsychoanalyst may be 
compared with some comments upon individual psychoanalytic and group 
analytic therapy by a Freudian psychoanalyst [Foulkes (30)]. He empha- 
sizes strongly how often individual psychoanalysis reveals only half the 
story about a patient, and often the less important half at that. He is 
impressed by how much that was not clear or suspected is learned by 
observing the patient in a group situation. He writes with conviction, men- 
tions the opportunities for research that his groups provide, but presents 
no data. A Kleinian psychoanalyst points out that the psychoanalyti 
method has largely ceased to be a historical method involving a reconstruc 
tion of the patient’s past and has largely become a method for investigating 
the influence of (characteristically unconscious) situations and relationships| 
in inner reality upon contemporary experience and behavior [Fairbairn] 
(31)]. The method may therefore now be described as ahistorical, not genetic.| 
The patient, it is assumed, is therefore helped most by dealing with his| 
“here and now” behavior in either a dyadic or a group setting. A Kleinian’ 
analyst interested in a strictly psychoanalytic approach to group therapy 
remarks that patients show greater changes by devoting one or two hours 
of treatment per week to such group therapy than to individual psycho- 
analytic sessions [Sutherland (32)]. In a symposium on group therapy, 
another psychoanalyst discusses various important difficulties of group 
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therapy, the saving of time for the therapist rather than for the patient, 
and the fact that patients behave differently in individual and in group 
sessions [Klein (33)]. 

Thus a number of psychoanalysts have found it necessary to expand 
their therapeutic orientation, and appear to be approaching Cameron’s 
emphasis upon the necessity, during therapy, for the patient to interact 
with one or more persons in a variety of ways, if the therapist is to under- 
stand enough about him, and the patient is to change significantly in a 
reasonable time. 

Since apparently clearly defined therapeutic orientations are so difficult 
to sustain in their initial forms, the interesting question arises whether 
factors common to the various orientations exist which may be more rele- 
vant to the therapeutic process experienced by the patient than the philoso- 
phy or technique of the orientation. As is well known, the question has been 
studied ingeniously by Fiedler and his co-workers. Their work is sum- 
marized by Fiedler (34). The conclusions that the essential aspects of a 
therapeutic relationship are a more elemental component in therapy than 
methods or techniques and that good therapeutic relationships are primarily 
a function of expertness and are independent of school make one skeptical 
of the virtues claimed for new techniques, especially when unsupported by 
some kind of evidence beyond assertion. The conclusions also direct atten- 
tion to the need for more explicit definition of ‘‘expertness,”’ ‘‘good thera- 
peutic relationship,” etc. 

When therapy has taken place, presumably the patient has learned 


‘something new during the good therapeutic relationship he has experienced, 


and what he has learned has generalized to interactions with other persons. 


| Thus therapy may be considered to deal with the notion of behavior dis- 


~ 


order as interpersonal incompetence, and to be a form of social learning 
|Shoben (35)]. Shoben presents a formulation of neurosis that stresses its 
social origins within the family and its social consequences as they affect 
later interpersonal behavior, drawing from laboratory studies of conditioning 
and instrumental learning the principles by which these experiences produce 
neurotic patterns of behavior. He is able to do this without postulating 
discontinuity between human data and those from other biological sources, 
and without the necessity of falling back upon such difficult-to-define con- 
cepts as id, ego, superego, and psychic energy. Utilization of the same 
sources and principles permits him to give a description of psychotherapy 
as a resocialization experience, within the framework of the two-factor 
learning theory, and (again) without the need of the difficult-to-define con- 
cepts above mentioned. An attempt to describe in more detail the events in 
a psychotherapeutic sequence as they may be pictured in terms of the princi- 
ples utilized by Shoben and by Dollard & Miller (36) is given by Hofstaetter 
(37). His hypothetical model for the psychotherapeutic process leans chiefly 
upon avoidance and approach generalization-gradients involving tabooed 
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acts, neutral acts, pertinent verbalizations, and neutral verbalizations, and 
appears to be congruent with the known psychotherapeutic orientations. 
Mowrer (38) presents clearly the major emendations of Freud’s concep- 
tions of neurosis and psychotherapy with which he has been concerned for 
some years. He finds it congenial to utilize those difficult-to-define concepts 
which Shoben has demonstrated can well be dropped, cites telling general 
considerations in favor of his view that repression is directed against the 
superego rather than the id, but submits only small samples of the kind of 
evidence that could be crucial in deciding between his views of the direction 
of repression (and important related aspects of neurosis and psychotherapy) 
and Freud’s. He indicates that a book with sufficient evidence on these mat- 
ters is in preparation, and judgment will have to wait upon its appearance. 
There are significant differences between Shoben’s and Mowrer’s ways of 
looking upon neurosis and psychotherapy in the light of the same learning 
theory, for appreciation of which the references themselves must be studied. 
Very few attempts have been made to design an experiment in which 
present knowledge of learning is utilized to fashion deliberately a technique 
of psychotherapy, in contrast to the effort which has gone into applying the 
conceptual framework of learning theory to psychotherapy as a process 
[Shoben (35), Mowrer (38)]. One such experiment is available in mimeo- 
graphed form from the Veterans Administration Hospital at North Little 
Rock [Peters & Jenkins (39)]. In an earlier publication, the hypothesis was 
stated that the schizophrenic process is a replacement of adaptive behavior 
by stereotyped “frustration behavior’; that such a process sets in when an 
individual’s frustration tolerance is passed; and that when the resulting 
maladaptive stereotyped behavior results in more frustration, the process 
is typically progressive [Jenkins (40)]. The hypothesis suggested a possible 
method of treatment consisting of the reestablishment of flexible adaptive 
behavior by a process of retraining, utilizing simple techniques developed 
in learning studies with animals. The hypothesis rests upon the experimental 
work of various students of learning and frustration, particularly Maier, 
who has also shown that the stereotyped behavior is reversible by a process 
of guidance to ‘correct solutions” of previously frustrating situations (41). 
The study by Peters & Jenkins is an effort to evaluate the effectiveness of 
such “guidance” in patients with chronic schizophrenic psychoses not show- 
ing a favorable response to other methods of treatment. A small number of 
patients, 12, was chosen for each experimental run. They were selected from 
one closed-ward building, met certain predetermined criteria, were matched 
with regard to a behavior rating scale and placed randomly into three groups: 
group I to receive the special learning treatment; II, to receive treatment 
identical with that of group I, except for the absence of the learning treat- 
ment; and III, to receive the routine treatment given all patients on their 
ward. Groups I and II received subshock insulin injections every morning, 
then attended occupational therapy, where in a few minutes group III 
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joined them. Every week-day morning, groups I and II received intense 
occupational therapy, working with one or two nurses, two aides, two student 
nurses, and occasionally an occupational therapist. During the two-hour 
occupational therapy period, patients in pairs (one from each group) were 
taken to another room for breakfast; the order of eating breakfast early or 
late was rotated. The patients in group II, on reaching the breakfast room, 
received breakfast at once; group I patients went to the experimental room, 
where they spent 20 to 30 minutes at problem solving with fudge as a re- 
ward. The problem solving was done individually with one patient and a 
psychologist in the room. After his test run, the patient had breakfast and 
returned to occupational therapy. The group III patients were not known 
as such to the building personnel. 

The experiment was continued for three months. In this experiment, 
insulin-induced hunger was used to motivate, on a biological level, learning 
by human beings. After 13 months, the insulin was dropped for groups I 
and II; it no longer seemed necessary for motivation of their problem solv- 
ing behavior. 

Solution and learning of problems was the critical variable. It began 
with a simple obstruction problem: getting a piece of fudge covered by a 
transparent plastic tunnel 18 in. X2 in. X2¥% in., with three 6 in. sticks that 
were available at one end of the tunnel. Once a patient had perfected a 
method of solving this problem (2 to 4 days), he went on to a more difficult 
one, and so on. Modified stylus mazes, circular pencil mazes, multiple 
choice boxes, reasoning problems and proverbs were used. Fudge was used 
as a reward for all these types of problems. The last type of problem was an 
interpersonal relation or social type of problem, chiefly simple psychodrama 
to stated themes. 

The patient thus moved from simple to complex problems, from motor 
to ideational problems, from individual to social problems, and from prob- 
lems in which there was a direct reward (his movements automatically got 
him the fudge) to those in which the reward was indirect (fudge handed him 
by the psychologist for correct solutions to circular maze and to reasoning 
problems, no fudge as reward in the psychodrama situation, but praise from 
the group leader for the least of efforts). 

No patient completed all of the problems without some help. This guid- 
ance was given in a planned, consistent manner, even to the psychologist’s 
moving a stylus correctly through the maze pathways while the patient was 
required to hold the end of the stylus. The procedure for each patient was 
planned carefully from day to day. 

Four different kinds of measurements were used to evaluate the effective- 
ness of treatment, three were made during the six months after the experi- 
mental period. The measurements were objective ward incidents (transfer 
to a better ward, home on trial visit, destroying property, etc.), the behavior 
rating scale, occupational therapy ratings, and number of days worked in 
‘special occupational therapy.” 
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At the time of the mimeographing of the material here summarized, three 
sets of 12 patients each had been observed for six months after completion 
of the experimental period. The hypotheses investigated would lead to the 
expectation of evidence of greater improvement in group I than in II or III 
and in group II than in III. The observed group differences for groups I and 
II were favorable to group I for all four measurements, and the three possible 
differences for groups I and III were favorable to group I. The results ob- 
tained with the objective ward incidents are statistically reliable between 
group I and the other two groups, and suffice for a rejection of the null 
hypothesis that there are no real differences between the groups. The results 
as to differences between groups II and III were inconclusive. Thus the re- 
sults indicate that guided problem solving, motivated by hunger, is an effect- 
ive method of improving the social adjustment of chronic schizophrenics. 
The effects of the treatment seem great enough to be measurable in small 
samples. The procedure appears to warrant much further study. 

I shall conclude this section of the review with notice of various unrelated 
contributions. Some important points are made about the importance of cul- 
tural understanding to psychotherapy in a description of three child guidance 
cases [Chess et al. (42)]. These came to a clinic which dealt with patients from 
varied national and ethnic backgrounds. The material is of anecdotal type. 
It indicates that diagnosis, prognosis, understanding of what is happening in 
therapy, and of the therapist’s proper behavior may all be seriously in error 
if there is insufficient awareness of cultural differences between therapist and 
patient. Though so slight in substance, this paper is one of the very few exist- 
ing applications of the cultural approach to personality in clinical psychiatry. 
A broad consideration of various social factors in therapy, with some illus- 
trative survey and case material concerning therapists and patients, is to be 
found in Ruesch’s contribution to a meeting of the Association for Research 
in Nervous and Mental Disease (43). A number of the points made are gen- 
erally neglected, yet are of considerable importance to the more effective 
practice of psychotherapy in situations of significant differences between 
therapist and patient as to social class, cultural background, status, etc. 
Seven dramatic examples (anecdotal) of the hypnotherapeutic unmasking, 
intensification, and recognition of an emotion are given by Rosen (44). One 
of the therapeutic techniques made possible by the hypnotic interpersonal 
relationship is the encouragement to show some real feeling. Under such 
circumstances, a number of patients who blot out conscious awareness of 
emotions may experience intensely sex desire, rage, fear, etc. These emotional 
responses and their contexts of situation and symptoms may be worked 
with therapeutically. The seven patients so treated have been followed for 
1 to 2% years. The considerable dangers of such a technique (self-deception 
of the therapist, suicide of the patient) are pointed out. Practical necessity in 
an inpatient child psychiatric setting led to the awareness that separate 
therapists need not be assigned to the several members of a child’s family 
[Szurek (45)]. One therapist could be effective in individual therapy with 
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several adults. As this was realized, it seemed also to become clear that the 
previous procedure, customary and taken for granted as standard, implied 
that several therapists, each working with one member of a child’s family, 
could work effectively with each other. This, it was now realized, could not be 
assumed to be so, but depended upon practice and experience of the several 
therapists in collaborating in just this way. Thus a therapist’s experience in 
psychotherapy of individuals does not necessarily become, by transfer, an 
acceptable level of skill in interpersonal dealings with other therapist- 
colleagues: such skill needs to be learned. The material is anecdotal. A book 
is available in which specialized techniques in psychotherapy are described 
[Bychowski & Despert (46)]. The material is impressionistic, with no eviden- 
tial support other than summarized clinical experience. The techniques 
range from narcotherapy to the use of the telepathy hypothesis. A new book 
on the roots of psychotherapy, has been published by Whitaker & Malone 
(47). They describe therapy in terms labelled ‘‘experiential,”’ as essentially 
intrapsychic, nongenetic, ahistorical, atemporal, therapist-activated, and 
dealing mainly with id processes until the patient’s fantasy needs are ful- 
filled. The patient’s growth processes having been thus catalyzed, preferably 
as intensively and quickly as possible, therapy is terminated. There are 
many provocative observations and ideas in the book, and many uncon- 
ventional ways of looking at therapy. Therapy is viewed as a symbolic fan- 
tasy relationship, and the therapist may use aggression, physical contact, or 
going to sleep, as techniques. The ‘‘brief depth therapy”’ is insufficiently ex- 
plicitly described, and no systematic data are shown that deal with the proc- 
ess or its results. 


INDIVIDUAL PSYCHOTHERAPY: AS OUTCOME 


Because of the unsatisfactoriness of our criteria for the outcome of psy- 
chotherapy, the present state of this subject is one of extreme confusion. We 
have to deal, on the one hand, with the enthusiastic conviction of psycho- 
therapists and of many of their patients that the time, effort, and money 
expended have been associated with long-lasting fruitful changes in the life 
pattern of the patient; and, on the other hand, with the contradictory or dis- 
illusioning results of the efforts to study outcome in a systematic manner. 
Eysenck (48) has for some time pointed out that his survey of the available 
reports of improvement of neurotic patients after and without psychotherapy 
fails to support the hypothesis that ‘‘psychotherapy facilitates recovery from 
nervous disorder.”’ The criteria of improvement used in the studied reports 
are discharge from a hospital, employment, and the like. In an unpublished 
study (by other investigators) of 100 patients with neurotic and psychotic 
types of behavior disorder, who did not have any planned psychotherapy, 
and who were re-evaluated from 14 to 6 years later, it was found that criteria 
such as job stability, marital stability, geographic mobility, and amount of 
health care did not discriminate the minority of the patients who showed 
significant improvement in adjustive techniques ‘and symptoms from the 
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majority who did not show such improvement [Peters & Saslow (49)]. Ob- 
servations of this kind, if confirmed, would make necessary the search for 
rather different classes of criteria of outcome, such, for example, as those 
suggested by Seeman & Raskin (50): degree to which irrelevant cues are dis- 
regarded or are distracting, speed with which hypotheses are rejected when 
the evidence for them becomes negative, number of different hypotheses 
which are formed, etc. 

A careful study in which clear-cut criteria of the outcome of therapy were 
defined and, in addition, would be generally considered highly relevant to the 
kinds of changes looked for when therapy had been successful, is reported 
by Teuber & Powers (51). This was a social experiment, in which for the 
eight years 1937 to 1945, 325 underprivileged boys in the Cambridge-Somer- 
ville (Massachusetts) area had guidance, counseling, and therapy, in the hope 
of preventing delinquency. Provision was made for the following: existence 
of an untreated control group, with 325 members matched individually, at 
the outset, with the members of the treatment group; systematic attempts 
at an objective description of the therapeutic relationships; and quantitative 
measurement of the effects of therapy. Matching variables included age, 
intelligence quotient, school grade, delinquency rating, ethnic background, | 
etc. Ten counselors whose treatment activities were not of one therapeutic | 
orientation, but ranged from a big-brother relationship to psychiatric case 
work therapeutic interviews, participated originally. With the passage of 
time, a psychoanalytically trained supervisor was added to the staff. All) 
treatment consisted of individual, face-to-face contacts. After eight years, | 
treatment had lasted for two to eight years in individual cases, with a me- 
dian duration of four years two months. 

From 1937 to 1948, the total number of court appearances was 264, as a 
result of offenses committed by 96 boys in the treated group, and 218, 
committed by 92 boys in the control group. From 1938 to 1946, 49 boys in 
the treated group appeared one time before The Crime Prevention Bureau, 
and 65 boys two or three times; corresponding figures are 49 and 52 boysin 
the control group. Unless further trends change the present picture (and they 
might), the direct comparison between the two groups fails to sustain the 
major hypothesis that treatment would reduce the incidence of adjudged 
delinquency. 

The counselors believed that the treatment program had substantially 
benefited about two-thirds of the treated boys; more than half the boys 
volunteered (after the treatment period was over) that they had been 
helped. Careful examination of the records of the treatment process (not! 
verbatim records but daily logs) showed that a counselor-boy relationship 
which was poor was characterized by mutual misinterpretation persistently 
from the beginning [thus being illustrative of the considerations of client- 
therapist interaction put forward by Butler (22)]. It also appeared that cer- 
tain counselors developed such relationships more frequently than others 
[thus illustrating the conclusions reached later by Fiedler (34)]. 
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} Before the evaluative results were known, all but two members of the 
treatment staff predicted significant success with the treated group. Hence 
an evaluation in the style characteristic of the vast majority of articles on 
outcome of therapy ever published, based on the therapist’s own story and 
lacking objective indices of observed behavior, would certainly have resulted 
in another report of very successful therapy. To some of the counselors, the 
ideas of control group and of objective indices, etc., were meaningless; they 
felt the relationships established were valuable in themselves, even with the 
negative results of the delinquency prevention program. Others believed that 
all the necessary rules for conducting therapy were known and that research 
was superfluous. When informed of the final results, the psychoanalytic mem- 
bers of the treatment staff who had this idea asserted that the results would 
have been positive had there been consistent application of psychoanalytic 
principles, while the Rogerian counselors with this idea made identical asser- 
tions about systematic use of nondirective methods. The authors state 
dryly that one thing is clear from the data: the burden of proof is on anyone 
who claims specific results for a given form of therapy. 

| Appel et al. (52) discuss published reports of long term psychotherapy 
' (over three months) by different methods. Their Table I, on p. 25, shows the 
results of treatment of 4131 psychoneurotic patients in 12 carefully made sets 
of observations available from Britain, Germany, and the United States, by 
psychoanalytic therapy, face-to-face psychoanalytically-oriented therapy, 
reassurance, re-education, hypnosis, distributive analysis, etc. The data ac- 
cumulated from follow-up periods of from 1 to 20 years. The criteria of im- 
provement of different observers are hardly comparable. Methods of obtain- 
ing follow-up information vary tremendously. Therapy processes are im- 
possible to define, from the published data, in most reports. Nearly all the 
studies survey patients treated from 1910 to 1939. 

Factors such as those above mentioned and various others make the valid- 
ity of comparison between different methods of treatment very doubtful. 
Improvement seems to take place in about 67 per cent of psychoneurotic 
patients who have psychotherapy (range 55 to 76 for 11 studies, 87 for the 
twelfth). 

Their Table II, on p. 25, shows the results of treatment of 1651 schizo- 
phrenic patients in seven centers before the introduction of insulin, electro- 
shock, and brain surgery. Improvement was reported in 39 per cent (range, 
25 to 50). More recent reports from groups especially interested in intensive 
psychotherapy with schizophrenic patients show improvement in 41 per 
cent of 139 patients (Chestnut Lodge 1944 to 1949) and in 60 per cent of 37 
patients (J. Rosen). 

The authors point out that, if the available figures mean anything, they 
may mean that psychotherapy has no effect whatever on patients, or that 
(if the natural history of the behavior disorders were known) nonspecific and 
common elements of the different types of therapy would be found to out- 
weigh grossly in effectiveness their dissimilarities. On the grounds that it is 
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contrary to experience, the authors reject the first possibility and accept the 
second. 

That the common elements in different types of psychotherapy are of 
major relevance to the changes which take place in patients is a hypothesis 
congruent with the observations of Fiedler and with the impressions of vari- 
ous able psychotherapists; also, it is attractive and plausible in its own right. 
It has been clearly demonstrated, however, in far simpler situations, that 
reasoning of this kind does not necessarily answer questions of causal rela- 
tionship in any specific manner [Kliiver (53)]. He cites a number of observa- 
tions from different fields which are examples of an “equivalence of stimuli’; 
in that heterogeneous stimuli, such as diverse chemical substances, disease 
processes, or sensory stimuli, often produce the same series of effects in an 
organism. If one attempts to account for such an equivalence of stimuli by 
specifying the set of properties or the common factor involved in producing 
the same effect, one may find no lack of general theories, or vague hypotheses 
purporting to account for particular forms of equivalence, or (as in the case 
of chemical agents) abandon all hope of explaining the results on a chemical 
basis, or that it is possible only to specify such common factors as are not 
relevant to the effects to be explained, or that the common factors must be 
located in the processes of the affected organism. 

A few observations of the outcome of psychotherapy are available in the 
field of the psychosomatic or ‘‘stress” disorders. Stine & Ivy (54) made a 
preliminary report dealing with the effect of psychoanalysis on the course of 
peptic ulcer. Although they circularized the entire membership (368) of the 
American Psychoanalytic Association for help in the study, their material to 
1951 is described as completely inadequate to shed light on the question, ‘‘How 
does psychoanalysis alter the course of peptic ulcer?” It was found possible to 
collect 17 cases, of which 7 proved suitable for a follow-up study. Two were 
unimproved, and five were improved. The paper discusses the difficulties 
which beset the authors, who propose to continue the study. A point of inter- 
est is that 80 per cent of the 253 psychoanalysts who replied to inquiries by 
the authors had not psychoanalyzed a single case of ulcer. A preliminary re- 
port of an appraisal of the results of treatment of various stress disorders is 
given by Berle et al. (55). The results of treatment of 739 patients in a clinic 
emphasizing psychotherapy, with follow-up of one to two years after dis- 
charge, are summarized. The physicians had been trained chiefly in internal 
medicine and were supervised by a senior psychiatrist. Therapeutic tech- 
niques varied from highly directive to nondirective, from suppressive to vig- 
orously exploratory activity. Free association, dream analysis, correlation of 
health state with events and attitudes, and encouragement to free expression 
of feelings were used. Each patient was appraised by his own physician and 
(by review of the latter’s appraisal) by the authors in terms of 852 items de- 
scriptive of him. Some of the disorders of the treated patients were irritable 
colon, asthma, peptic ulcer syndrome, essential hypertension, urticaria, 
vasomotor rhinitis, ulcerative colitis, and diabetes. A third of the patients 








326 SASLOW 


failed to show improvement, two-fifths improved symptomatically, and a 
quarter showed improvement considered to be basic in that it appeared to in- 
clude a reorientation or maturation of the personality. Since such data can- 
not answer the question whether the results achieved might have occurred 
spontaneously, other studies were planned in which matched treated and 
control groups would be compared. Thus 34 patients with ulcerative colitis 
treated in the special (psychosomatic) clinic were carefully matched with 34 
other patients with ulcerative colitis treated in various other clinics of the 
New York Hospital (where the main emphasis of therapy had been placed on 
diet, medication, and antibiotic agents). For each patient treated in the 
special clinic, another was sought who matched him in age, sex, duration and 
severity of illness as judged by x-ray and other criteria, etc. Compari- 
son showed consistent advantages in the group receiving psychotherapy, 
with regard to deaths (psychotherapy group 9 per cent and matched group, 
18 per cent), operations (3 per cent versus 29 per cent), worse (0 versus 6 per 
cent), same (23 versus 21 per cent), improved (39 versus 18 per cent), 
symptom free (26 versus 15 per cent). The samples are small, but the differ- 
ences are thought to be significant. Subsequent material dealing with other 
stress disorders in which the same matching procedure was followed has been 
made available by Grace (56). For 23 untreated patients with peptic ulcer 
matched with 23 ulcer patients who had psychotherapy, the outcome was 
better in the group having psychotherapy (number of operations, number 
worse, same, improved). For 27 patients with asthma matched with 27 with 
asthma who had psychotherapy, the outcome was better in the patients hav- 
ing psychotherapy (number worse, same, improved). Similar data are being 
collected for publication by Grace on all the stress disorders studied in the 
psychosomatic clinic of the New York Hospital. 

Zubin (57) copes with the problem of delineating a design for the evalua- 
tion of various types of therapy. Basic criteria for such an evaluation are met 
by less than a dozen published long-term follow-up studies. In these studies, 
nearly all the immediate outcomes indicate a definite advantage for the 
specific treatment (whether psychotherapy, psychoanalysis, or other) but no 
advantage for the long-run comparison. At present, no one is able to explain 
such observations. 

Hunt (58) argues that the question of the outcome of psychotherapy can- 
not be neglected, as a matter of professional responsibility, and cannot be 
solved by studies limited to changes in the patient during the process. Nor, 
if change in the patient is in fact associated with psychotherapy can we omit 
to ask if the relationship is causal, which question requires determination of 
the frequency with which the same kinds of change occur without psycho- 
therapy. He proposes a large-scale design, complex in execution, that would 
provide three kinds of data: (a) about the initial state of the patient (b) 
about the patient and therapist during the psychotherapeutic process, and 
(c) about changes in the patient’s subjective state, behavior, and effects on 
others. 
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Interest in the therapy of one patient with several therapists has been 
shown in several quarters. The material published is anecdotal, but, from a 
number of points of view, illuminating. Dreikurs et al. (59), in a second paper 
on the subject, describe briefly four patients who had simultaneous multiple 
therapy. The authors believe the process to have certain obvious advantages 
for the patient: (a) he can be more detached, (b) he can interact with two per- 
sons, (c) he can see a therapist make mistakes, (d) he can see an example of 
a sustained working relationship (between the two therapists) despite dis- 
agreements, (e) he can observe one therapist act out patient’s role in difficult | 
situations and learn more quickly his own cues and responses in that role, and, 
(f) his dependency problems are more easily dealt with. Multiple therapy | 
with juvenile delinquents is described by Buck & Grygier (60). Anecdotal 
material is presented about seven boys and two girls. The patient worked 
with a male and a female therapist, simultaneously, successively, or alter- 
nately, or with one therapist while the other therapist worked with mother. 
Home visits were made by either one of the therapists. The authors consider 
the advantages of these procedures to be: (a) the therapists have more in- 
sight into the patient, (b) the therapists have more awareness of the patient’s 
total situation, material and human, (c) there is direct dealing with the total 
sphere of the patient’s disturbed relationships, and (d) there is better han- 
dling of transference behavior in all the cases. So impressive has the procedure \ 
of multiple therapists seemed to the staff at Chestnut Lodge, that it appears 
to have become a recognized part of the treatment program |Dryud & Rioch / 
(61)]. They use the term multiple therapy to mean treatment by interview of | 
one patient by more than one therapist at the same time. They describet 
therapy of seven patients, each of whom had been having intensive individual 
psychotherapy. All but one continued with individual therapy, after multiple 
therapy was stopped. In six, there were less than seven multiple therapy inter- 
views. Thus multiple therapy was used as a type of brief therapeutic inter- 
vention within the framework of long-term individual therapy. The material 
presented makes the following remarks pertinent: (a) the goals, methods, and 
results seem as varied as in individual therapy, (b) the procedure seems to 
have much possibility for dramatic resolution of an impasse in long-term 
intensive individual therapy, and (c) it apparently has this possibility be- 
cause new observers see quickly and easily so much that psychoanalyzed 
therapists do not see about the patient’s and especially about their own be- 
havior. In addition, the individual and the multiple therapists seem subse- 
quently to be better able to deal with each other as colleagues, are less com- 
petitive, less defensive, etc. The improvements in the therapist’s perceptions 
and behavior occur in so little time that we must consider the multiple 
therapy situation one of high learning potentiality, yet it simply amounts to 
observation of and participation in a labelled overt interpersonal situation 
a few times. The perceptual distortions, rationalizations, ineffective behav- 
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iors, etc. of the individual psychoanalyzed therapist in an impasse with his 
patient constitute observations confirmatory of those published from Chest- 
nut Lodge previously by Stanton & Schwartz (62): i.e., adequate ability to 
see oneself objectively as a therapist and to perceive one’s patients and col- 
leagues objectively is evident after personal psychoanalysis and formal psy- 
choanalytic institute training with a frequency which leaves much to be de- 
sired. Yet objective appraisals of the necessary kinds are relatively easily 
made if others can observe the therapist in action, and, moreover, can easily 
be communicated to said therapist, who, still more surprisingly (in view of 
his and our preconceptions) quickly puts them to good use in the form of new 
responses to old cues involving his patients and his professional colleagues. 
‘As might be expected, Dryud & Rioch found therapists initially reluctant to 
| face the multiple therapy situations, though later appreciating their use- 
\ fulness. The implications of these observations for the training of therapists 
and for therapy are not discussed in the paper. Yet the difficulties in per- 
ceiving, evaluating, and utilizing a patient’s and a colleague’s verbal, non- 
verbal, and social behavior are by no means limited to the therapists at 
Chestnut Lodge, who are trained in the manner advocated by H. S. Sullivan. 
‘Very similar difficulties are indicated among therapists observed at Yale, 
where the training is closer to Freudian psychoanalytic procedure [Caudill 
| et al. (63)], and just as little therapeutic use appears to be made of the inter- 
personal relations among their patients and between themselves and their 
{colleagues by therapists observed at Washington University [Henry (64)]. 
\The therapists at Washington University are trained by a wide variety of 
methods (none like that of Sullivan): some follow the Freudian procedure, 
om follow that of the Chicago Psychoanalytic Institute, some are eclectic 
‘in their practice. The implications of observations of such similar difficulties 
of therapists in centers utilizing training procedures thought to be so dissim- 
ilar are, therefore, well worth a glance. Perhaps all present methods of 
training psychiatrists as therapists, long and expensive though they are, 
omit learning in social situations to such a degree as to fall far short of 
achieving certain definable results with even fair frequency. It is conceivable 
that therapeutic effectiveness could be learned more quickly by systematic 
application of the observations made by Dryud & Rioch. And if therapists 
could increase in effectiveness by learning so much more quickly than we now 
think they can learn, why not patients? Thus the observations of the multiple 
therapists suggest important new ways of looking at therapy, the chief 
feature of which is that we accept considerably lower expectations of the 
ability of a patient to generalize individual interview learnings to other inter- 
personal situations without practice in those situations. 


Group THERAPY 


Since Sanford’s 1953 survey (65) of the ideas about psychoanalytic group 
therapy held by psychoanalysts in Britain who are followers of M. Klein, 
was published, no research material by members of this group seems to have 
appeared. 
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A book summarizing a project in group psychotherapy which started in) 
1946 has been published by Powdermaker & Frank (66). The project was} 
undertaken within the United States Veterans Administration under the \ 
auspices of the Washington School of Psychiatry. It began in 1946 with a | 
seminar group which discussed informally the available material on the psy- 
chology of group behavior and on research methods in the field; next there 
was therapy of an ambulant (chiefly neurotic) patient group (July, 1947 to 
December, 1948) and of a hospitalized chronic schizophrenic group (July, 
1947 to May, 1949); and finally, came the analysis and publication of what 
was done and learned. The type of therapy used may be described as psycho- 
analytic in orientation, and much more like that of Foulkes (30) than of 
Sutherland (32). Most or all of the 24 psychiatrists had had, or were having, 
psychoanalytic therapy and analytic institute training. Five clinical psychol- 
ogists and two psychiatric social workers were members of the project staff. 
All group therapy sessions were conducted by psychiatrists (including psy- 
chiatric residents), and observers (social worker or clinical psychologist) 
were present at nearly all sessions. Various methods of studying the process 
of therapy, e.g., using verbatim transcripts, categorizing and rating defined 
items in the group sessions, utilizing all or a representative sample of the 
available material, were looked into and discarded. The investigators found 
it necessary to limit themselves to the study of the problems of therapy which 
were of most immediate concern to the doctors and on which they had the 
most material. They tried to isolate the specific problems so defined, and to 
devise methods for valid study of them. They came to rely primarily upon 
data obtained by trained observers, working closely with the psychothera- 
pists, and analyzed qualitatively. 

The basic data consisted of the following: running accounts by observers 
of the activity in the group; a selection of what seemed to be therapeutically 
significant; summaries of meetings prepared by observer and therapist; 
situation analyses of events which appeared to have negative or positive 
therapeutic significance; therapist’s judgment about a patient; psychologist’s 
test data; and social worker’s interview with patient, focussed on patient's 
interpersonal relations. 

The patients studied in the clinic consisted of 124 white male veterans 
(9 psychotic.) Of the patients studied in the hospital, 88 were on an experi- 
mental ward and 86 on a control ward. Very few of the therapists had any 
prior experience with group therapy, and a number who worked with schizo- 
phrenic groups had practically no experience with individual therapy of 
schizophrenics. All therapists were volunteers. Most of the patients had 
both group and individual therapy sessions. 

The difficulties of the project, such as inexperienced mobile personnel, 
administrative problems in the hospital project, thwarted desire to deal with 
the many variables of the group therapeutic process that could be explicitly 
defined and yet appeared mutually dependent, and unsolved methodological 
problems of criteria of outcome of therapy, were staggering. Despite them, 
much was learned for subsequent studies on a smaller scale. The careful pres- 
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entation of the operation of the project and of a number of situation analyses 
will be found very rewarding. Only a few points will be mentioned here. 
Careful study of the completeness and accuracy of the observer’s running 
account of a session showed by no means such a great inferiority to mechan- 
ical verbatim recording as might be suspected (Appendix B, pp. 531-48). 
The psychiatrist’s evaluation of individual patients, upon which much de- 
pended for evaluation of outcome of therapy, seemed to the reader as unsatis- 
factory here as elsewhere (Appendix C, pp. 549-52). The psychologist’s use 
of the Rorschach test in the project (Appendix E, pp. 565-79) is subject to 
the most serious criticism since it was used in a global fashion that implied, 
without evidence to prove it, confirmation in a patient’s social and group 
behavior of his overall Rorschach test performance and of specific aspects of 
that performance. Such use of the Rorschach ignores the great many pub- 
lished studies which show failure of specific determinants, patterns of Ror- 
schach signs, or global Rorschach interpretations to predict any specific or 
general behavior of a subject [see for example, Auld & Eron (3); Roe (67)}. 
Roe found it impossible to guess from the Rorschachs of eminent Ameri- 


| can scientists either their high sustained achievement level or how they 


} differed from hospitalized patients. Group therapy was more taxing and 
| complicated than individual therapy, the doctors thought; and even those 


; 
f 


with considerable experience in individual therapy showed little quick gen- 


‘ eralization to the group therapy situation, or even quick learning in it. 


The idea that personal psychoanalysis frees a therapist to behave or to 
learn more easily in interpersonal situations is weakened (a) by one doc- 
tor’s awareness that he fantasied his observer (who was being psychoana- 
lyzed by the doctor’s own analyst) reporting to his analyst what the doc- 
tor did poorly (p. 328), and (b) by another’s feeling threatened at the 
prospect of closeness between two members of the group, and especially 
anxious in the presence of social conversation (p. 325, 326). There are 
suggestions in brief hints like these to make one wonder: (a) if transfer- 
ence relationships are ever worked through; or, (b) if worked through, facili- 
tate easier interpersonal relations; or, (c) if working through them means a 
new permanent irrational tie to one’s analytic therapist; or, (d) if having 
worked through them, one now needs to spend an equal amount of time 
learning how to deal with closeness, social conversation, or group therapy 
situations. Some remarkable observations are presented that have to do with 
intense rivalry situations set up between schizophrenic patients in a group 
in relation to a definable kind and intensity of doctor-patient relationship in 
that group (pp. 392-434). Under certain conditions, two such patients and 
the doctor appear to have undergone a symbolic fantasy experience of 
very high intensity, certainly for the patients and probably for the doctor, 
with subsequent sudden strikingly favorable behavioral alterations in one 
of the rivalrous patients: the whole procedure reading like an illustration of 
the thesis of catalyzed growth through satiation of intense infantile needs in 
the book by Whitaker & Malone (47). A unique description is that of therapy 
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with a group of 87 chronic schizophrenic patients, most of whom also at- 
tended the small therapy groups (pp. 466-92); the large group functioned 
for 1% years and had advantages of its own. 

As to outcome, in the clinic group, improvement occurred in 61 per cent 
of those who had individual and group therapy and in 52 per cent of those 


who had only group therapy (pp. 38-65). A weakness in such an evaluation 


was the fact that chief weight was given to the doctor’s judgment; even | 
though this was influenced by social worker and psychologist information, | 


one should recall the study of Teuber & Powers (51) which showed how little 
relation the intense conviction of interested parties in therapy may have to 
explicitly defined outcome. In the hospitalized group, analysis of many cate- ( 
gories of comparison with the control ward group showed critical ratios sig- 
nificant at the 1 or 5 per cent level in only a few instances (pp. 344-58). 
In general, the data indicate that the experimental patients tended more 
often to react overtly in stressful situations than the controls, and that the 
group therapy had a stimulating effect upon outwardly purposeful activities 
and verbal relationships with others. 

Some provocative illustrations scattered throughout the book (66) show 
that in group therapy the presence of a new doctor may elicit transference 
reactions, new life history material, etc., which had not appeared before; 
or may enable patients for the first time to express certain attitudes toward 
the former doctor or feelings thought unacceptable to him. It will be recalled} 
that Foulkes (30) is impressed by how much is learned in a group situation | 
that was not clear or not suspected in individual therapy sessions. Taken| 
together, these observations suggest that psychotherapy might be more ef- 
fective if the patient’s therapy included either (a) interacting in a representa- 
tive sample of his life situations or (b) being encouraged to give adequate 
descriptions of such a sample of his interactions, trying out new ones, evalu- 
ating them with the therapist, again trying, etc. 

The second idea is undoubtedly used more or less systematically by 
many therapists who are not of a single therapeutic orientation. The firs 
idea leads, by a natural extension, to the plan of making the patient’s entir 
life situation therapeutic for a limited period. This has been attempted b 
Jones, and described in his book The Therapeutic Community (68). The plan 
of the Industrial Neurosis Unit (a hospital) grew out of two earlier experiences 
with therapy in groups, the first dealing with 100 patients who had neuro- 
circulatory asthenia, and the second with 1200 exprisoners of war (who rep- 
resented the most maladjusted 1 per cent of the 100,000 British repatri- 
ates from the European theatre of war). Therapy in the Industrial Neurosis; 
Unit covers the whole working day of the patient and includes all his con-; 
tacts with other persons. Intensive individual treatment is discouraged, few, 
patients had uncovering psychotherapy, and none had psychoanalyti 
therapy. The patient-sample was drawn from desocialized patients with\ 
severe character disorders (chronic unemployed neurotics) consisting of drug 
addicts, prostitutes, prison offenders, schizoid personalities, sexual perverts, 
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aggressive psychopaths, etc. One-third of 104 patients traced in follow-up 
visits were failures, 22 per cent made a “fair adjustment,’’ and 44 per cent 
held good records (health, work experience, family life) for six months after 
a duration of three to four months of treatment. All treatment was free 
through the Ministry of Health of the United Kingdom, and many of the 
patients have war pensions. Over 85 per cent of the patients admitted had a 
history of psychiatric treatment. 

It appears to have been demonstrated that it is possible to change social 
attitudes in relatively severely desocialized patients, if they are treated ina 
group by a staff which develops clear concepts of each member’s role and 
which becomes a therapeutic community. The community workshops simu- 
late normal factory conditions where the patients can practice mastering 
their tensions associated with work roles, while in the hospital. The level of 
common staff responsibility and of free communication necessary for an 
entire staff to function actively in therapy is of course very high, and cannot 
be achieved without major alterations taking place in doctor-doctor, doctor- 
nurse, and all-other-staff relationships. It is doubtful that the results 
achieved by Jones could have been achieved by conventional psychiatric 
hospitalization and individual psychotherapy (many of his patients are re- 
garded by psychotherapists as incapable of socialization). It is of interest 
also that criteria of general adjustment such as job holding may be discrim- 
inating among patients of the kind treated by Jones. 


GENERAL COMMENTS 


The experimental studies of the psychotherapeutic process appear to be 
leading to an enriched general and comparative psychology. The most satis- 
factory studies of the outcome of therapy, if we define this as modification 
of patients’ behavior through the planned behavior of professional persons, 
appear to be those of the New York Hospital group on stress disorders (55, 
56) and of the Belmont Industrial Neurosis Unit (68). Both studies suffer 
from the short duration of follow-up and from other admitted deficiencies, 
and neither utilizes the most intensive types of psychotherapy now known. 

In 1950, at the Hixon Symposium on psychoanalysis as science, Hilgard 
(69) remarked that 


anyone who tries to give an honest appraisal of psychoanalysis as a science must be 
ready to admit that the bulk of the articles in its journals cannot be defended as 
research publications at all. Having said this, I am prepared to reassert that there is 
much to be learned from these writings. The task of making a science of the observa- 
tions and relationships may, however, fall to others than the psychoanalysts them- 
selves. 


My reading of the published material on psychotherapy this year leads me 
to conclude that Hilgard’s remark is as appropriate in 1953 as it was in 1950. 
It also leads me to conclude that it applies not only to psychoanalysts, but 
rather generally to medical psychotherapists as a class. The most promising 
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and the soundest work in the field is being done by psychologists, in the 
sense of work that will ultimately be part of a general, comparative psychol- 
ogy. 

The quantity and quality of the scientific work of psychiatrists is related 
to progressive loss of curiosity with professional age, in a report to be pub- 
lished by the Committee on Psychopathology from the Group for Advance- 
ment of Psychiatry (70). Related to the postulated decline in research moti- 
vation with age are thought to be at least four obvious factors: lack of per- 
sonnel, lack of free time, lack of funds, and lack of training in research. 
The last factor, even when investigation is done, perhaps explains the large 
proportion of single case reports and simple counting procedures. It is also 
notable that the psychiatrist’s constant therapeutic activity orients him 
towards research on therapy exclusively, often of a retricted character, when 
he does research. 

The possibility that the psychoanalysts, who gave the impetus to the 
enormous enrichment of psychological understanding of the last decades, 
will save the general situation is made to seem rather remote by the picture 
of psychoanalytic research presented by Glover (71). His address before | 
the International Psycho-Analytical Congress on research methods in psy- 
choanalysis is based on his own long clinical experience and his observations | 
as Director of Research of the London Institute of Psycho-Analysis for 16| 
years. He believes there has been an increasing tendency by psychoanalysts| 
not to apply to their data such scientific controls as are available. He de- 
scribes vividly how hearsay evidence becomes attested conclusion (given an 
analyst with seniority, enthusiasm or plain dogmatism); how a student whose 
professional career depends on overcoming ‘‘resistance”’ to the satisfaction of 
his training analyst can hardly be expected to defend his own scientific in- 
tegrity against his analyst’s theories and practice, so that inherent in the 
training situation is a tendency to perpetuate error; how, no matter how 
ideal their own analysis, individual analysts tend to show at meetings and 
elsewhere their own conflicts and ‘‘favorite pathological mechanisms’; how 
these three factors lead to the stereotyped proceedings of the Psycho-An- 
alytical Association; and how peculiarly susceptible to fashion, ‘‘canalized 
no doubt through a hierarchy of transferences and countertransferences”’ 
are psychoanalytic groups. He has the impression that present-day psycho- 
analytic teaching preserves many of the disadvantages of mid-Victorian 
pedagogy and few of its advantages, and that the deficiencies of such authori- 
tarian spoon-feeding are not remedied, as is often thought, by the candidate’s 
training analysis. He believes it is time these issues were faced, and that a 
first task of psychoanalysts is to settle down to the arduous task of defining 
terms, verifying criteria, and developing reliable statistics. His portrayal of 
the association between lack of scientific productivity and the psychoanalytic 
training procedure is that of a self-reinforcing system with tremendous in- 
ternal resistance to change. 


If we relate Glover’s appraisal to the observations of Powdermaker & 
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Frank, already mentioned, and to those of the multiple therapists, it is diffi- 
cult to avoid the question, “could some other method of training psycho- 
therapists and psychiatric research workers be found?” And if the most 
thoroughly psychoanalyzed persons, the psychoanalysts themselves, appear 
to have undergone so little change in their ‘pathological mechanisms,”’ is it 
not time for careful study of the outcome of psychoanalytic therapy of 
patients? 

So far as I know, the only published material on the training of psycho- 
therapists which presents an explicit alternative to the difficulties of which 
Glover gives so clear a picture is a paper by Cameron (72) describing a pro- 
gram of postgraduate instruction in psychotherapy in operation at McGill 
University for five years. The program there makes no special point of per- 
sonal psychoanalytic therapy or of psychoanalytic institute training, but does 
make a point of providing for the low generalization of the complex skills and 
understandings needed by psychotherapists, through a wide variety of types 
of study and experience over a four-year period. 
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COUNSELING METHODS'?* 


By C. GILBERT WRENN 
University of Minnesota, Minneapolis, Minnesota 


This chapter is slightly different in content and emphasis from the cor- 
responding chapters in earlier volumes. Action taken by the Editorial Com- 
mittee simplified the title of the chapter and provided for the inclusion of 
studies in psychological measurement only as these related directly to coun- 
seling. A further restriction has been made in order to minimize the confus- 
ing overlap of the terms ‘‘counseling’”’ and ‘‘psychotherapy.”’ With the co- 
operation of Professor Saslow, author of the chapter on Psychotherapy, it 
was decided to make no attempt to distinguish between counseling and psy- 
chotherapy but, in the main, to include in this chapter only studies that have 
a nonmedical setting. Studies in medical settings, whether using the term 
psychotherapy or the term counseling, are covered in the chapter on Psy- 
chotherapy. This decision will not please every reader but the confusion in 
terminology required some such definitive action as this. Rogers (57) reports 
a study of members of the American Psychological Association who, in the 
APA Directory, express an interest in the “practice of psychotherapy.” 
Forty-four phrases indicating this interest were isolated, 16 of which involved 
some use of the rubric ‘“‘counseling.”’ Incidentally, Rogers finds an increase 
in mention of this combined function in the Directory from 1948 to 1951 of 
almost 5 per cent (from 23.5 per cent to 28.3 per cent). The proportion rises 
to 35 per cent for psychologists born at the quarter-century mark as com- 
pared with 21 per cent for those born at the beginning of this century. 


THE DEVELOPMENT OF COUNSELING AS A PROFESSION 


Counseling, as a field within psychology, is making progress toward status 
as a profession. Much of this chapter is devoted to studies of instruments, 
procedures, and methodology in research, in some of which lie the claim to 
being a science. The practice of counseling is the other side of the coin. 

The attention of psychologists in general has been drawn to the field of 
counseling by two events of the past year. The first of these is the designation 
of ‘‘Counseling Psychologist’”’ as a new position in Veterans Administration 
mental hospitals and in Veterans Administration guidance centers (two 
separate programs). The second event was the publication in June, 1952, of 


1 The survey of the literature to which this review pertains was completed on 
May 1, 1953. 

2 The following abbreviations are used in this chapter: APA (American Psycho- 
logical Association); MMPI (Minnesota Multiphasic Personality Inventory); OL 
(Occupational Level). 

3 Assistance in the review of literature for this chapter was given by Edward H. 
Seldon, Court Psychologist, Minneapolis, Minnesota. 
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two reports of the then Division of Counseling and Guidance, one on rec- 
ommended standards for the doctoral training of counseling psychologists 
(12) and one on recommended standards and procedures in the practicum 
aspects of this training (77). It would seem to the writer that these two re- 
ports, officially adopted as the statements of Division 17 of the APA?, will 
inevitably exert a strong influence in the development of this field. This may 
suggest a personal bias for he was president of Division 17 at the time and 
became personally involved in this culmination of committee work that had 
extended over several years. Even now, however, these reports are providing 
the basic structure for the evaluation of doctoral training programs by the 
American Psychological Association that was requested by the Veterans Ad- 
ministration. In this connection Long’s review (40) of Veterans Administra- 
tion Guidance Centers points out that the 29 million dollars that has been 
spent in counseling some one and one-half million veterans represents only 
two-tenths of 1 per cent of the total cost of veterans’ educational benefits. He 
points up both the restrictions of this vast program of counseling and the 
gains made. A small amount of research on counseling outcomes is cited. 

The selection and professional education of counselors is outlined by 
Wrenn (88) together with a description of a small research project in selec- 
tion. He stresses the problems involved in balancing content (science) and 
procedure (practice) in graduate training programs. Cottle (16) reviews the 
existing research on the characteristics and the selection of counselors as the 
first in a series of studies to be reported on this topic. A seven-year study of 
psychotherapists at the Menninger Clinic by Luborsky (41) has some sig- 
nificance for counselors; in this he reports on a comparison of the top and 
bottom 13 per cent in rated competence as psychotherapists. The high group, 
in contrast to the low group, were more apt to be: sensitive to others, inde- 
pendent in thinking and judgment, subdued in warmth, quiet rather than 
expressive, able to express themselves appropriately, younger, married, and 
conventionally adjusted. No levels of confidence for the significance of these 
differences is reported. 

Inservice development after attaining the doctorate is described by 
Samler (61) as more important than preservice professional training. Some 
attractive procedures in this field are suggested. Failor & Williams (23) de- 
scribe the coordinated program of four universities and colleges in Colorado 
for taking graduate training to the counselors in the schools of the state. 
This is one of the better examples of what is done in over one-half of the 
states to develop school counselor competence. 

Another phase of professional development is the attention given to 
ethical standards and practices. A code of 104 statements of ethical practice 
for counseling is drawn by Gluck et al. (27) from existing codes in medicine, 
law, psychology, and social work. An adaptation of the APA code is made 
for counselors by Wrenn (89) who analyzes ethical responsibilities under the 
headings of responsibility to the client, to the employing institution and 
society, and to the profession. 
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SYSTEMATIC FORMULATIONS AND THEORY 


Several basic statements on counseling theory have appeared during the 
past year. One of these is a formulation of the counseling process that is 
based upon learning theory. Perry & Estes (53) believe that the learning 
process in counseling involves both selective, or self-learning, and directed 
learning. The nondirection and direction in learning are not a continuum but 
occupy two distinct realms; not polar opposites, but part of a complex that 
requires coordination. 

With this learning theory as foundation, Perry & Estes propose that the 
process of counseling with most college students is one of ego-building, not 
release of infantile conflicts. There are two stages in Perry & Estes’ ‘‘inte- 
grational counseling’’: (a) the permissive or nondirective, which is used until 
the client accepts the responsibility for initiative and evaluation of his own 
problems (until he achieves ‘‘the heuristic set’’), and (b) the collaborative 
phase, in which the counselor is accepted as an aid in the process of counsel- 
ing. This latter phase requires the use of skill in helping the student to see the 
problem as external both to himself and to the counselor, as upon a screen in 
front of both of them. The authors provide a few data from interview analy- 
ses intended to indicate that the interview proceeds from nondirective to 
collaborative in accordance with their theory. 

Another basic theoretical proposal is made by Butler (14) who considers 
the behavior of the psychotherapist in terms of the following hypothesis: 
“In two-way personal-social interaction each individual concerned displays 
anticipatory behavior which in turn elicits anticipatory behavior on the 
part of the other, the anticipatory behavior of the other in each case confirm- 
ing the prior learnings of the individual about himself.’’ He develops an ex- 
tensive learning theory, involving considerable reliance upon transfer and 
social learning, and with this proceeds to indicate how the therapist’s be- 
havior becomes the stimulus for the learning of the client. The article is par- 
ticularly illuminating with regard to the resistance phenomenon. 

A theory of vocational development proposed by Super (78) takes issue 
with Ginsberg’s occupational choice theory, particularly his failure to define 
clearly the nature of the compromise process which is at the heart of the 
choice made. Super constructs a ten-point theory of vocational development 
stressing the dynamics of the long process of choice and adjustment, and the 
interplay of interests, aptitudes, self-concept, and work satisfactions. A 
monograph by Small (71), referred to later, suggests that the compromise 
process ‘‘is a function of the adequacy or strength of the ego.’’ He limits the 
vocational counseling process (a limitation which would not be acceptable to 
Super) by saying, ‘Vocational counseling, emphasizing as it does realistic 
factors, is best able to benefit the realistically oriented individual. Strength- 
ening such weak (non-realistically oriented) egos is the problem of the ther- 
apist rather than the counselor.’’ The reviewer is reminded of the distinction 
made several years ago by Butler between the adjustment and the distribu- 
tive (planning) phases of counseling. 
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Another attempt to reduce complexity to orderliness was that of Black 
(10) who analyzed the features of counselor-client relationship that are com- 
mon to all therapies. He summarizes five common factors and suggests that 
we be cautious in attributing successful counseling to any particular method 
or procedure when the nature of the relationship between counselor and 
client may be the potent factor in the situation. Hahn (32) also attempts to 
find common elements in different approaches to counseling. He believes that 
there is a trend toward considering therapy as a phase of counseling, quoting 
Bordin to the effect that counselors work primarily with the normal range 
of personal adjustment, and Pepinsky’s statement that counselors work with 
“minor, functional maladjustments.”’ It is apparent that counseling is far 
from being integrated, however, and that any man’s statement may be one 
which he wishes he might modify at a later stage in his or the profession’s 
development. 

Two expositions of client-centered therapy are made by Rogers, one (58) 
in Mowrer’s recent book (48a) and one in the Scientific American (59). In the 
first of these Rogers restates some of his convictions regarding the self- 
concept and the positive and constructive nature of basic personality tactors. 
He opens this chapter by limiting his discussion to client-centered therapy 
“since I have reluctantly come to concede the possibility that the process, 
directions, and end points of therapy may differ in different therapeutic 
orientations” and closes it in the hope that “gradually some of the specula- 
tions and opinions and clinical hunches of this paper may be put to opera- 
tional and definitive test.’’ No one can quarrel with a theory that states its 
boundaries and asks that it be tested. It is interesting to note the research 
studies Rogers selects to use in his second paper, which is for ‘‘nonpsycho- 
logical’’ consumption: those by Snyder, Sheerer, Haimowitz, and Thetford. 
Rogers concludes that the research thus far has contributed more to increased 
knowledge of the dynamics of the personality than to improvement of thera- 
peutic procedures. 

Four papers contribute to self-concept theory. Two of these by Roberts 
(56) and Bills (9) use the Index of Adjustment and Values to test the rela- 
tionship between emotionality and the discrepancy between self-concept and 
ideal self-concept. The results support the assumption of a definite relation- 
ship which is persistent over a period of time. The other two papers support 
each other in showing a relationship between degree of self acceptance and 
measures of emotional adjustment. One, by Taylor & Combs (79), used a list 
of questions which are probably true of all children yet which are damaging 
to the self-concept to admit. Using sixth grade children who had been divided 
into those who were well and those poorly adjusted, they found that the well 
adjusted group were able to accept significantly more of the damaging state- 
ments. The other, by Calvin & Holtzman (15), used college fraternity mem- 
bers to whom was given the MMPI? and a “‘self-other’”’ rating form using 
seven traits. One finding was that those who deprecated self (in comparison 
with others) tended to be more poorly adjusted generally. 
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A constructive attempt to integrate counseling operationally was made 
by Maclean (42) who drew upon learning concepts of differentiation and 
integration for his argument. Stating that differentiation (specialization, 
many approaches) has been paramount in counseling thus far, he pled for 
advancement to the state of integration and listed some effective steps to be 
taken in that direction. 


RESEARCH AND EVALUATION METHODOLOGY 


The eleven chapters on research procedures in Psychotherapy Theory and 
Research, by Mowrer and his associates (48a), defy analysis within the limited 
scope of a chapter such as this. Perhaps outstanding for counselors is the 
chapter by Seeman & Raskin (64) on research in client-centered therapy. 
They summarize between 40 and 50 published studies in this field under the 
headings of ‘‘Internal Dimensions of Therapy Research,” (research on ele- 
ments of the counseling process) and ‘Studies in Outcomes of Therapy.” 
These are rich pages, to be read carefully, 

Without listing each chapter, the other features of the research section of 
the book involve a lively 90 pages by Mowrer on the constellation of Q Tech- 
niques and by Luborsky on the P Technique. Another group of three chap- 
ters deal with the tension factor in therapy and with tension as an evaluation 
criterion. Eighty pages, again by Mowrer, analyze the verbal factor in ther- 
apy, including the semantic researches by Grummon, Osgood, and Winthrop. 
Fiedler’s chapter on the function and measurement of the therapist’s feelings 
toward the client, and McQuitty’s chapter on configural analysis are other 
examples of the research value of this volume. The volume should add much 
to the resources of anyone contemplating research in counseling. 

An ambitious coordinated effort to analyze research methods in counsel- 
ing is reported by a team from the Division of Counseling Psychologists 
consisting of Dressel (19), Shoben (68), and Pepinsky (50). The first paper 
outlines the problems of criteria, design, subtle versus obvious aspects of the 
process of counseling, contamination, and financing. Dressel believes that 
outcomes must be established before process can be studied and that these 
outcomes should be sought regardless of any value judgments regarding 
them. Shoben takes issue with this, believing that the results of counseling 
per se are not sufficient, that they must be tested against the client’s psycho- 
logical health in the client’s ‘‘extraclinical world.” The client’s psychological 
health is of necessity related to the values (acceptance) of his community, 
so that evaluation is in terms of not only the client’s extraclinical behavior but 
also of ‘“‘the way that behavior is evaluated by himself and others.” After 
summarizing five common theories of counseling research, Pepinsky makes 
a case for the development of design upon a clear statement of the psycho- 
logical rationale (bias, assumptions) that is involved. He outlines five basic 
steps essential in any counseling research: (a) research objective (theory 
based upon interaction of observations and conceptualizations), (b) testable 
hypotheses, (c) unambiguous design for testing hypotheses, (d) explicit and 
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detailed planning before executing any part of the project, and (e) discussion, 
in which restraint is used in the drawing of conclusions. Dressel advocates use 
of statistical control through covariance methods of analysis in preference 
to control groups while his team mate, Shoben, issues a strong warning 
against use of criteria residing wholly within the interview. 

Eleven basic conditions that must be satisfied before successful therapy 
can be claimed, which might well become the criteria for any study purport- 
ing to evaluate counseling, are given by Thorne (80). To be noted in particu- 
lar is the need for demonstrating that (a) the process is not self-limiting or in 
a self-curative phase and (b) the effects are more than the results of a simple 
treatment of behavior symptoms. Watson in this same issue of the Journal of 
Clinical Psychology (84) bares five shortcomings of all published research on 
the effectiveness of psychotherapy. 

In a carefully safeguarded study of the amount of verbal participation 
in a group as a criterion that might be useful in counseling evaluation, Pe- 
pinsky et al. (51) used performance on a Rorschach and sociometric ratings 
as the dependent variables. With 184 variables, six multiple R’s were 
found to be significant. These provide the basis for three hypotheses: (a) 
the talkative, socially sensitive individual tends to receive a larger proportion 
of favorable role choices in a group, (R=.87), (b) the frequently chosen, in- 
tellectually striving individual will tend to talk relatively often in the group, 
(R=.82), (c) the more mature individual who tends to perceive the unusual 
will tend to be more accepting of others in the group, (R=.70). Pepinsky and 
two associates also developed a Group Participation Scale for use as a cri- 
terion in evaluating counseling (52). This sociometric device, when adminis- 
tered in five fraternities, resulted in a distribution that was spread over al- 
most the entire scale (from 2 to 8 out of a possible 1 to 9) and that had only 
slight negative skewness. More careful studies of criteria such as these by 
Pepinsky are urgent needs in the counseling field. 

The tricky factor of client motivation, as a variable in counseling re- 
search, is well illustrated in a study by Gutherie & O’ Neill (30) and a report 
by Rothney & Mooren (60). In the first study a perhaps too-simple design 
was used in evaluating the outcomes of a program of dormitory counseling 
on study habits. Because no differences in grades were found after 10 weeks 
between counseled and noncounseled groups the author proposes that the 
missing factor is that no distinction was made between students who wanted 
counseling and those who did not. All in the experimental group received 
counseling whether they wanted it or not. Rothney & Mooren report on the 
nature of a 95 per cent response to a mail follow-up of counseling. Analysis 
shows that those who had been interviewed most and who had requested 
further interviews responded first of all. This is the type of motivation-factor 
which should be accounted for in all follow-up evaluations. The authors list 
other characteristics of those responding early, but no report on tests of sig- 
nificance accompanied these statements. 
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Although a number of studies reviewed (and not reported) showed great 
naivete, the reviewer is encouraged by some of the major studies in theory 
and in research methodology that are reported in this chapter. Careful 
theory will encourage research attacks: cautious and sophisticated research 
designs will demand basic theory for testing. The advance on the two fronts 
simultaneously seems essential. I am speaking here of such basic theory 
statements as those by Butler, Perry & Estes, and Super, and of the research 
proposals by the authors of Mowrer’s chapters and by the Division of Coun- 
seling Psychologists’ team. 


PsYCHOLOGICAL MEASUREMENT IN COUNSELING 


Interest measurement.—The most significant study in interest measure- 
ment during the past year was that by Strong & Tucker (76). This was more 
than a study of medical interests; it was written to be helpful to those who 
are considering medicine as a career and to counselors who must answer such 
questions as these: ‘‘Do I have the interests of a medical man?” “If so do I 
have the interests of a specialist or of a generalist, and if the former, what 
kind of specialist?’’ With funds for a three-year study from the Office of the 
Surgeon General, four medical specialists’ scales (Surgeon, Pathologist, 
Internist, and Psychiatrist) and a revision of the Physician Scale of the 
Strong blank were produced. 

One feature of the research was the development of a ‘“‘medical-men-in- 
general” group so that the interest patterns of the medical specialists could 
be compared with the interests of both ‘‘men-in-general”’ and ‘‘medical-men- 
in-general.’””’ The four specialty blanks differentiated the specialists from 
physicians-in-general with an average r of .79. They differentiated equally 
well among each other. Of special interest to psychologists is the consistent 
tendency in factor analysis and in other comparisons, for the psychiatrists to 
be most unique in their interests, of the four medical specialties, and furthest 
from the interests of physicians in general. In fact, in one factor analysis, 
psychiatrists had a slightly smaller loading of ‘‘medical interests’ than did 
psychologists! On a loading called ‘‘interest in people” the psychiatrist was 
highest of all the medical groups and closer to psychologists and personnel 
men than to the other medical men. This study, both for its deliberate inter- 
est to be helpful to counselors and for its clear exposition of some significant 
research approaches, is a valuable ‘‘find’’ in the literature of the year. 

Melville & Frederiksen (44) studied the relationship between interest 
patterns on the Strong blank and the first-year academic achievement of en- 
gineering freshmen and found several significant correlations between grades 
and the Strong scales. Layton (39), in a more elaborate prediction of aca- 
demic success in veterinary medicine, found that scores on the Veterinarian 
scale of the Strong blank provided one of the significant predictors. Do these 
studies suggest that a3 universities select more carefully in terms of intel- 
lectual ability (provide a more intellectually homogeneous group) measures 
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of interests as motivation indices become increasingly important in predic- 
tions? [See also the studies by Schofield (63) and Morgan (48), reported un- 
der Prediction.] 

Three doctoral studies under the direction of Super were reported in a 
four-author monograph as a coordinated attempt to test the validity of the 
assumption that the Occupational Level (OL) score of the Strong test is a 
measure of motivation or drive (2). Barnett’s study of several hundred 
vagrants yielded slight but inconclusive evidence that OL? score measured 
drive or ambition. Another study by Stewart compared OL score with social 
background and plans. The score was found to be related to immediate but 
not to long range plans, and to have but little consistent relationship to social 
background. Handelsman used high school students’ statement of aspira- 
tions, two aspiration tests, and teachers’ rating of student aspiration. Only 
the latter showed any relationship to OL score and that was not of any mag- 
nitude. Super concludes, ‘‘the evidence now available warrants interpreting 
the Occupational Level scale asa measure of status of interests. The evidence 
does not warrant interpreting the OL score as a measure of drive.” 

Among a number of other studies concerned with interest measurements, 
Dressel & Matteson (20) studied the relationship between interests and ex- 
perience of college students. The analysis showed that: (a) in general the 
expressed interest was conditioned by the amount of experience; (b) there 
was a greater variability of expressed interest than of experience (not sur- 
prising perhaps for young men); (c) that interest was greater than experience 
in the science and music areas, but that the reverse was true in the clerical 
and persuasive areas. This is a suggestive study but one cannot help but agree 
with the authors that a forced-choice blank is not the most effective method 
of recording vocational experience. 

Steinberg (72) used the Kuder Preference Record to study differences 
between two groups of disabled veterans classified as ‘‘neurotic’’ and ‘‘non- 
neurotic” (the latter, probably not a pure nonneurotic). He concluded that 
“A Kuder profile characterized by low interest in Mechanical and Scientific 
areas and a relatively high interest in the Literary and Musical areas, in the 
absence of other evidence to support the validity of these interests, may indi- 
cate maladjustment.” 

Strong (74) analyzes several bases for the differences found between the 
interests of several Negro occupational groups and corresponding groups 
of white men that were reported in Hartshorne’s 1948 study. He agrees that 
there is a real difference between the two samples in that Negroes seem to 
like more items than whites but believes that a Negro men-in-general group 
must be developed before any substantial answers can be found. 

Personality measures ——If the reviewer were to accept at face value the 
gloomy review of personality inventories given by Ellis (22), there would 
be no need to write further in this section. Ellis reviewed ‘‘all researches on 
personality inventories’ published from 1946 through 1951 and concludes 
that there is little good in any of them, because they are not measuring inde- 
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pendent traits, they do not agree with each other or with projective measures, 
they are easily faked, they usually do not give significant group discrimina- 
tions. Ellis has greater faith in ‘‘interviewing and the use of projective tech- 
niques that seem to be more clinically incisive and valuable.” 

Another and somewhat more helpful review of research is that by Cottle 
(17) which condenses the findings of 215 research studies using the MMPI 
that have appeared up to 1951; it also lists 38 such studies that were pub- 
lished in 1951. There is an excellent discussion of interpretation, including 
Welsh’s two ratios, and a good description of the meaning of each scale with 
“‘normal’’ populations. A 1952 study by Blanton & Landsman (11) bears on 
the criticisms of Ellis since these two authors used the Group Rorschach and 
the MMPI on the same population which was retested after three months. 
They conclude that these two tests have different functions as far as college 
populations are concerned and that neither can be considered an effective 
substitute for the other. Another MMPI study, by Gough & Pemberton 
(29), developed a list of 15 clinical signs for interpreting the results of this 
test in a prediction setting wherein single scale analysis had proved futile. 
The signs were not too effective in a cross-validation check but ‘‘the results 
highlight the error involved in concluding that the MMPI does not work be- 
cause single scale analysis fails.”’ 

Four other studies provide results on four different tests that are used in 
counseling. Shaw et al. (67) used the Minnesota Teacher Attitudes Inventory 
to measure changes in attitudes toward students and others produced by a 
two-weeks workshop. Drucker & Remmers (21) used the SRA Youth In- 
ventory to test differences between students judged ‘‘good” and ‘“‘poor” in 
school adjustment and found differences at the 1 per cent level between 
these groups in every area of the Inventory but two. Merrill & Heathers (45) 
provided another study of Wechsler-Bellevue subtests and found scatter to 
be such a common phenomenon among university students that any use of it 
in personality diagnosis seems highly inadvisable. Brogden (13) factor-an- 
alyzed the Allport-Vernon Study of Values and found that only four of the 
original scales were identified in this analysis: Aesthetic, Theoretical, Social, 
and Religious. The original Political and Economic scales become one second- 
ary factor. (This latter point was also an outcome of Lurie’s 1937 study but in 
that study the Aesthetic scale disappeared.) 

Three attempts to provide new measures of academic efficiency or study 
habits belong only in part under the heading of ‘‘personality measures.”’ 
Gough (28) developed a 38-item ‘‘achievement scale involving such personal- 
ity factors as self-confidence, purpose, and acceptance of others, as well as 
items on study habits,”’ that gave an r of .44 with high school grades on a 
cross validation population of 234 seniors. A scale of attitudes toward college 
was developed by Woodman (87) which revealed marked differences in atti- 
tudes between ‘“‘over’’ and ‘‘under” achievers. When the attitude items were 
weighted and the blanks scored, the two distributions (‘‘over’’ and ‘‘under”’ 
achievers) were independent for 68 per cent of cases, but with scores used 
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or prediction purposes he secured an r of only .30 between scores and first 
semester grades. Michael & Reeder (46) developed a 47-item inventory of 
study habits, which produced an r of .55 against high school scholastic aver- 
age, and, combined with the American Council on Education Psychological 
Examination, a multiple R of .83. These relatively high correlations dropped 
to an r of .31 and an R of .79 for high school girls, and to .32 and .56 respec- 
tively for college freshmen. No cross validation was reported. Here, as also 
suggested by some of the studies in interest measurement, are reported ef- 
forts to tease out attitude, motivation, and other nonintellectual factors that 
will contribute to the mystery of ‘‘Why do some students who are intellec- 
tually able achieve scholastically and others fail to do so?” 

Aptitude tests—Studies of the General Aptitude Test Battery are re- 
reported by Storrs (73) and by Jex & Sorensen (33) in which retest reliability 
of six of the tests ranged only from .70 to .88 but in which test G correlated 
highly (.80) with the Verbal scale of the Wechsler-Bellevue test in one study. 
In another study, however, test G correlated no higher with first quarter 
college grades than did test Q for women and test V for men. Follow-up 
reports on the Differential Aptitude Test are given by Bennett et al. (5) and 
by Wesman (86), but the data are reported in such general terms that exact- 
ness of interpretation is difficult. In general, college curricula chosen or jobs 
held seem to reflect part scores on the battery, but no tests of significance of 
the differences between scores are reported. Wesman points out that the 
validity in terms of part-score correlation with course grades is specific to 
each situation, i.e., no general validity conclusions can or should be drawn 
because of variation in the criterion (course content and grading standard). 

Prediction studies—Studies of scholastic prediction in the three pro- 
fessional schools of law, veterinary medicine, and medicine at the University 
of Minnesota are reported respectively by Berdie & Layton (7), Layton (39), 
and Schofield (63). Seven variables in Berdie and Layton’s study of law 
students produced a multiple R of .75 with freshman grades. These were the 
conventional intellectual variables, with prelegal grades the best single pre- 
dictor. Only one out of 46 students predicted to have grades of 80 or more 
failed to achieve them, while only 8 out of 43 predicted to have failing grades 
of 60 or less succeeded in passing. The prediction of success was better than 
prediction of failure, an unusual accomplishment. In the veterinary medicine 
study, Layton used only 3 variables out of 20 to produced an R of .60 with 
first quarter grades. As mentioned earlier one of these was the Veterinarian 
scale of the Strong Interest Blank. 

The part of the medical school study reported by Schofield in the current 
article deals only with the relationship of MMPI scores to junior year 
medical school grades. Students in the upper quarter as compared with stu- 
dents in the lower quarter of grades have significantly lower mean scores on 


‘ The title of this section is somewhat misleading for many of the studies grouped 
under test categories are primarily prediction studies. 
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scales 3 (hysteroid), 4 (psychopathic deviate), and 8 (schizoid). This finding 
was checked in various ways but held constant, which led Schofield to suggest 
that students who show both a restricted intellectual promise and marked 
elevations on these three scales will be likely to prove poor academic risks. 

A comparison study of high achieving and low achieving liberal arts 
students of high ability again stresses nonintellectual differences. Morgan 
(48) found that the high group in comparison with the low group had: (a) 
more Strong scores in Group V and fewer in Groups VIII and IX; (0) fewer 
MMPI elevations on Scale 4 (Pd) (this checks with Schofield’s finding), 
higher scores on special MMPI scales for Dominance, Social Responsibility, 
and Intellectual Efficiency; and (c) higher in the ‘“‘achievement motivation”’ 
scoring of the ThematicApperception Test (TAT). On the other end of the scale 
Wiegand (85) studied a group of 81 lower ability college entrants who had 
been placed in a special curriculum for the first year. All were presumably 
equally motivated to make high enough grades for transfer to a regular cur- 
riculum but the 41 who did so were different in several respects from those 
who had not made the required scholastic average. More of the ‘‘successes”’ 
(1 per cent level of confidence) had definite vocational choices, vocational 
choices that coincided with the ‘“‘most desired” occupation, and choices that 
were related to the college major chosen. Here we have one of the few studies 
that gives some hope to those who believe that the fact of vocational choice 
influences scholastic success. 

Lack of pressure from family as a favorable factor is attested to by Kirk 
(35). Puzzled by students who do well on objective tests of scholastic apti- 
tude but poorly on classroom tests, she postulates that academic failure of 
this type probably provides unconscious satisfaction for hostiiiiies directed 
toward some member of the family who demands success. She quotes a psy- 
chologist in the University of California Student Health Service: ‘‘On the 
basis of 1500 records collected here, the kinds of MMPI records most fre- 
quently associated with poor academic achievement are (a) Psychoneurosis 
with compulsive and depressive features, (b) Hysterical personalities.’’ This 
latter point checks with Schofield’s study. 

Biographical factors enter the picture in studies reported by Myers (49) 
and Berdie (6). The former tested biographical items selected from one fresh- 
man class in a women’s college as those that were related to achievement in a 
cross validation study using the succeeding freshman class. Items that dif- 
ferentiated between high and low achievement, at 1 and 5 per cent levels, 
were (in the direction of high achievement): Jewish family, from larger towns, 
with foreign born parents, youngest in family, low participation in high school 
activities but high in offices held. Berdie engaged in a follow-up of a large 
population of high school graduates one year after graduation to determine 
the outcome of plans stated during their senior year in high school. Students 
who were from homes high on the educational-vocational ladder were most 
likely to carry out their plans. The factor of economic determinism reared 
its ugly head again in the figures showing that 90 per cent of the high ability 
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children whose parents were in top-level occupations were able to attend 
college. The contrasting figure was 55 per cent for equally able students whose 
fathers were factory laborers. In general, factors favorable to plans for col- 
lege were these: (a) high economic and cultural level of home, (b) parents 
had attended college, (c) being a woman, (d) living in an urban area. There 
seems to be support here for the Warner-Havighurst-Davis school of thought. 

One study, of general usefulness, is that by Abelson (1) who used high 
school grades and scholastic aptitude tests to predict college grades for 3500 
students in seven coeducational institutions. He was able to state with some 
assurance that the greater predictability of women’s grades (often found in 
such studies, illustrated in this chapter by Wesman’s study) is not the result 
of predictor differences but is caused by the greater homogeneity of the cri- 
terion of women’s grades in college. 


PROCEDURES IN COUNSELING 


A sorting of the published material of the year that contribute to one’s 
knowledge of specific procedures in counseling aside from measurement and 
prediction, has resulted in the following grouping: (a) research studies in- 
volving a specific procedure or concept, (b) systematic but theoretical dis- 
cussions of a procedure, (c) some representative textbooks. 

In the first group the study by Richardson & Borow (55) examines the 
proposition that preparing students for the possible outcomes of counseling 
results in more effective counseling outcomes. Nineteen out of 25 criterion 
measures used to determine the effectiveness of the interview favored the 
experimental (prepared) group; six, significantly so. It seems clear that the 
prepared group got more from their interviews than the nonprepared group. 

There is encouragement in the outcomes of a program of counseling by 
Serene (65) in which he was able to motivate high school eleventh-grade 
underachievers to better academic performance. He counseled 124 under- 
achievers over the year regarding the realities of their past and potential ac- 
complishment with the result that correlation between aptitude and achieve- 
ment increased from .56 to .76; the number of pupils in the line of expectancy 
increased from 93 to 147 (out of 301 in class). No similar gains were found in 
the tenth and twelfth grades which served as controls. This study makes some 
contribution to self-concept theory. 

Somewhat more specific procedural elements are studies by Toman (81) 
and Berg (8). Toman had clients dictate their life histories after which the 
record was played back and each client analyzed the pauses he had made in 
the dictation. Such analysis was used as an introduction to therapy. Berg 
analyzed types of words used in a completed and recorded interview series 
to find that certain types of words (empathetic, negative) tended to increase 
as therapy progressed while other verbal correlates decreased (ego words 
and expletive sounds). Another attempt to isolate correlates of therapy prog- 
ress was the study by Dittmann (18) who found that a high level of coun- 
selor participation and counselor response to feeling or interpersonal be- 





XUM 








XUM 


COUNSELING METHODS 349 


havior were characteristic of the later stages of successful counseling. In 
the field of marriage counseling Mitchell reports a study (47) indicating that 
some of the factors in the initial interview have a closer identification than 
others with the ‘‘whole case” factors. This suggests that they are more pre- 
dictive and should be given more attention in the initial interview. 

In the group of theoretical discussions of counseling procedures Porter 
(54) defines five classes of interpretations made by the counselor. He devel- 
ops these as outgrowths of two broad types of counselor activities, interpret- 
ing the meaning of the client’s productions and expressing to the client the 
meaning that will best help him. A different type of process analysis is made 
by Beier (3) who defines three approaches that may be used with the in- 
voluntary client, the one “in whom resistance toward giving up symptoms 
and substitute gratifications is greater than his desire for help.” 

Brief discussions of specific procedures are found in studies on the use of 
reassurance by Laing (38), the individualizing of test interpretation by Kirk 
(36), reduction of anxiety in the testing situation by Sinick (70), and han- 
dling therapeutic overambition by Beigler (4). 

It is obviously impossible to characterize a textbook fairly in a single 
sentence and yet such is almost necessitated by the limits of this chapter. 
Soundly written, with stress on the dynamics of interviewing and with 
much practice and illustrative material, is the text for counselors and social 
workers by Fenlason (24). Another text is a thorough and clear analysis of 
the use of testsand many other procedures in studying students by Froehlich 
& Darley (25). Another text in a somewhat more general view and with a 
modified nondirective emphasis is that by Shostrom & Brammer (69) while 
the volume, Counseling College Students During the Defense Period (83), is 
specific in both purpose and the counseling procedures proposed. Hiltner’s 
follow-up (31) of his earlier book in pastoral counseling is a casebook contain- 
ing several chapters that will be suggestive to counselors in general; the 
chapters entitled ‘‘Action,” ‘‘Externals,’’ “Charm,” and “Hostility” in 
counseling are penetrating and of general psychological usefulness. 


VOCATIONAL CHOICE AND OCCUPATIONAL INFORMATION 


A monograph by Small (71) supplies some comments on Ginsberg’s 
theory of vocational choice which has caused considerable discussion since 
the appearance of his monograph in 1951. Small gathered data on fantasy 
and reality factors in the vocational choices of 144 boys aged 15 to 19 which 
do not support Ginsberg’s assumption of a linear relationship between age 
and realism. There may be a change from fantasy to realism or from realism 
to fantasy at each age level. The better adjusted boys were more realistic 
in their “first choice’ than the more poorly adjusted boys, but were less real- 
istic in their ‘“‘second choice.’ Small proposes that choice involves the inter- 
play of fantasy and reality, with reality perception being given full weight 
in the first choices of the better adjusted boys but fantasy given some satis- 
faction in the second choices. Kirk (37) reports the changes in vocational 















350 WRENN 


choice of 3100 veterans after the ‘‘brief counseling contacts’ of a Veterans 
Administration Guidance Center to be in the direction of current vocational 
trends, i.e., the change in choices seem to reflect realism. 

Strong (75) reports a significant finding from one of his numerous an- 
alyses of earlier and follow-up data. Choices made in the college sophomore 
year show a reliability of .80 with choices made in the freshman year, with 
measured interest reliabilities of from .88 to .91 for the same period. When 
choice and interest pattern were compared, however, it was found that the 
closer the relation between the two measured interest patterns the closer 
the relationship between the choices made in the two years. Samson & 
Stefflre (62) studied the relationship between parental occupation and the 
vocational choice of 1136 high school seniors in Los Angeles who had been 
exposed to 10 months of vocational counseling on a city wide basis. Use of 
chi-square indicated a clear relationship between parent’s occupation and 
child’s choice which they term a “significant secondary factor.”’ The primary 
tendency is still to choose a vocation at a higher prestige level than that of 
the parent. (Incidentally, Strong’s one-year follow-up showed no change in 
average prestige level of vocation chosen. This remained high, being 90 on 
the Stubbins’ scale of 1 to 99.) 

Another approach to a study of choice was made by Traphagen (82) who 
compared scores from the Strong Interest Blank for 30 veterans who had in- 
dicated a vocational choice of ‘‘high school teacher’ at the beginning of 
counseling but had changed their choice during counseling, with 30 who still 
maintained their precounseling choice of high school teacher. A primary 
pattern in Group V with A or B ratings on the Teacher scales discriminated 
in favor of those who retained their vocational choice of teacher. Primary 
patterns in Groups IX, X, and XI discriminated in favor of those who changed 
from their choice of teacher. 

The appearance of the second edition of Shartle’s basic book on occupa- 
tional information (66) will be welcomed by many. The revision contains 
several additions and changes such as the 1950 census classification data, the 
material of the revised Dictionary of Occupational Titles, and the North- 
Hatt Scale of Prestige Ratings. Kaplan (34) cites the availability of the P-B 
series of United States Census bulletins which give the numbers in each 
major occupational group by each county and each urban area in any state. 
The P-C series gives a more detailed occupational breakdown for states and 
cities. A special ‘‘national sample’’ report of all occupations is also available. 

Psychologists will be interested in at least one phase of Mahoney’s 
“‘expert-opinion” analysis (43) of the elements of occupational information 
that should be included in the professional education of school counselors. 
The area of greatest agreement between state supervisors, city directors, and 
university professors on the items that should be included was that of ‘‘voca- 
tional adjustment.” This area, having the largest number of items of any of 
the nine categories of the study (34), contains such topics as “personal 
characteristics that contribute to job holding and job security,” “relation- 
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ship of intelligence levels to vocational selection and vocational adjustment,” 
and “patterns of ability generally considered necessary for success in col- 
lege.’’ Germain et al. (26) presented their second report on the forms used by 
government and military agencies for categorizing physical characteristics 
used in job placement. The terminology of the physical capacity form and 
the physical demand (job) form should be common to both but is not. Only 


one U. S. agency, the United States Employment Service, shows any sub- 
stantial correlation. 


CONCLUSIONS 


The past year has seen the beginning acceptance of ‘‘counseling psychol- 
ogist’’ as the professional title at the Ph.D. level in this field without aban- 
donment of the term ‘“‘counselor.’’ There has also developed an improved un- 
derstanding of what is involved in professional training, in the research 
needed on the nonintellectual qualities presumed essential for practitioners, 
and in the application of a code of ethics to counseling. Professionally, this 
field of psychology is a little more nearly mature than it was a year ago. 
Knowledgewise and researchwise we have greatly benefited by a dozen stud- 
ies of major significance. A prediction is made by this writer that some of 
the past year’s writing on counseling and vocational choice theory, on 
research and evaluation methodology, on the Strong Interest Blank and on 
the MMPI will influence thinking and research on counseling for several 
years to come. 

It is common to have reviewers of research shake their heads and cluck 
their tongues sadly over the scarcity of ‘‘good”’ research. On the contrary, 
this reviewer is encouraged regarding the quality of much that he has read. 
Although there are still goodly amounts of shoddy research and ill proved 
claims there is enough also of that of which we can be proud to encourage us 
regarding future developments in this area. 
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EDUCATIONAL PSYCHOLOGY 


By JAMES MAXWELL 
Moray House, Edinburgh, Scotland 


INTRODUCTION 


The topics discussed in this review are derived from those publications, 
containing material of interest to educational psychologists, which have 
appeared during the period March, 1952 to March, 1953. It was not always 
possible to obtain in Britain the most recent issues of periodicals published 
in the United States in time for inclusion in this review; and it is probable 
that some contributions have been overlooked. The purpose of the review, 
however, is more to examine and evaluate the current trends in educational 
psychology than to comment critically on individual articles, so the omis- 
sion of some of the published material may not significantly disturb the 
general picture. 

No clear-cut definition of the term ‘“‘Educational Psychology” has been 
attempted; the scope of the review has been empirically determined by the 
selection of such material as would interest those directly or indirectly con- 
cerned with the process of education, within the limitations that the topics 
should be discussed from a psychological point of view, and that they should 
to some degree be an original contribution to the field of knowledge. 

Research and practice. —The relationship between research and educa- 
tional practice is discussed in several articles. Brownell (1) points out the 
value of pure as well as applied research, indicating that the results of re- 
search on rats may in the end have as much influence on education as 
research on children. He also thinks it is not a bad thing that researchers 
are not able to insist on their findings being applied immediately; a time lag 
is necessary for proper understanding and evaluation of research findings; 
and the results of research are influencing educational practice, though not 
as speedily as the research worker may wish. Also among the optimists are 
Gray & Iverson (2), who, in a discussion of the teacher’s attitude to criti- 
cisms, argue that the standards of attainment are at present at least as high 
as in the past, and that modern teaching methods are founded on the re- 
search results of a considerable number of years. It is worthy of note that 
both Gray and Brownell are in the fortunate position of having been con- 
cerned with research in the elementary school subjects for a long enough 
time to see the results of their work appearing in school practice. It is also 
noteworthy that both use Reading as the example which best supports their 
thesis. Others are not so sanguine. Leonard (3) stresses the change in em- 
phasis in education from skills to citizenship, and calls for further research 
in the nature and climate of the group, in the acquisition of moral values, 
and in children’s development. He accepts by implication the thesis of Gray 
and Brownell on the school subjects, but disregards their plea for time and 
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patience. Leonard’s plea for research in group relationships has been 
responded to in the period of this review, in quantity if not in quality, and 
some of the articles reviewed do give added force to the argument for a 
time lag for adequate evaluation of their results. Buswell (4) emphasises the 
difference between ‘‘Frontier’’ research and ‘‘Follow-up”’ research. Frontier 
research is exploratory, empirical, and less rigorous. ‘‘Follow-up’’ research 
is rigorous, carefully designed, and aimed at elucidating the theoretical im- 
plications of the frontier research. The distinction is a good one for the evalu- 
ation of research. Much of the research reviewed here is frontier research; 
it is exploratory, and often inconclusive and unconvincing, but to apply 
rigorous standards would be misleading, and for the findings to be put into 
practice might be disastrous. The follow-up research is less interesting, pos- 
sibly, but it is on that type of research that changes in educational practice 
are made. 

There does appear, however, to be a gap between the researcher and the 
teacher which could be closed. Gray & Iverson, in the article quoted, advo- 
cate some national establishment to co-ordinate research. Mackenzie (5) 
deplores the lack of co-ordination between studies in curriculum subjects 
and the work in schools, and suggests more team work in research with the 
participation of teachers. Corey (6) makes the same plea for greater partici- 
pation in research by serving teachers, and suggests methods by which 
teachers’ observation and records could meet the more exacting require- 
ments of the research worker. 

None of this is new, but it is good that these issues should be kept alive. 
It is heartening to find that experienced research workers are satisfied that 
research work in educational psychology is affecting school practice, though 
perhaps slowly. But both Buswell’s stages must be passed through before 
the findings are tested in the ultimate trying ground, the classroom. Closer 
liaison between teachers and research workers seems needed, but the fault 
is not wholly with the teacher. Many articles contributed by teachers have 
been excluded from this review because they were anecdotal, hortatory, and 
too specific; many research articles cited are open to the charge of being 
rather remote from classroom realities and frequently concerned with tests 
rather than children. Too often the educational purpose appears to be the 
acquisition of a higher degree rather than the extension of knowledge of 
children and education. In the end, however, time will prove to be a better 
evaluator than any reviewer. 


ScHOOL SUBJECTS 


Reading.—Reading still holds first place among the school subjects in- 
vestigated, though there has of recent years been a shift of emphasis from 
the elementary to the later stages of reading instruction. The complaint of 
Hill (7) that too little attention is given to research in higher education in 
the United States is not borne out in the field of reading. Half the articles 
cited below deal with higher education. 
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Elementary school reading.—It appears as if the subject of elementary) 
reading had been virtually exhausted. The frontier research in reading skills | 
was done many years ago, follow-up experiments have been extensive, and 
the outcome of all this investigation is presented in two very comprehensive) 
text books on reading by Anderson & Dearborn (8) and by Tinker (9). The 
title of Anderson & Dearborn’s book, The Psychology of Teaching Reading,| 
is a good indication of the treatment. Reading is treated initially as a func- 
tion of the child’s growth, but this approach is not systematically pursued 
through the book. Considerable attention is given to the perceptual aspects 
of learning to read, and it is here that the authors draw most upon the 
very large volume of research available. An excellent critical examination of 
methods of teaching reading is followed by a short discussion of evaluation. 
Though pedagogically useful, this book merits mention here as an example 
of the way in which a large and scattered body of research work can be 
brought together to enlighten and instruct the practising teacher. The title 
of Tinker’s book, Teaching Elementary Reading, is also an indication of its 
approach. The stress is on teaching methods, and more attention is paid to 
training than to maturation. A large portion of the book is devoted to reading 
readiness and Grade I methods. Preparatory activities and training pro- 
grammes are fully described, though little evidence is quoted in the text as 
to their effectiveness. The bibliography, though, is extensive. The book is 
apparently addressed to the serving teacher and suffers from the defects of 
most digests. It tries to be too comprehensive, forgetting sometimes that the 
field is governed more by the knowledge available than by the needs of the 
classroom. The result is a tendency to dogmatism and over-simplification, 
without adequate distinction being made between principles well-founded on 
research and those less well-founded. Though a bit diffuse, the treatment 
aims at covering the ground that the school teacher is concerned with, 
which it succeeds in doing comprehensively, if at times uncritically. 

The remaining studies are mainly concerned with the tidying up of the 
odd corners of the field of reading skills. Triggs (10) shows that word recog- 
nition, defined in terms of ability to match the printed and spoken word and 
to divide words into their syllables, does not appear to show improvement 
after Grade IV. It is probable that the essential skills have been mastered 
before that stage. Bond (11) and Smith (12) both raise doubts about the 
efficacy of present group methods of teaching reading. Both desire more 
individual instruction, which they maintain raises the standard of reading 
attainment, though neither explains how the teacher is to cope with largish 
classes. Haugh (13) investigates the effectiveness of radio listening as a sup- 
plement to reading. For 539 Grade XI pupils,! it was found that after seven 
weeks, a greater gain in information was made by the readers than by the 


1 We adopt the British usage of describing one still in attendance at school as a 
pupil, reserving the term ‘“‘student” for one attending a college, university, or other 
institution for further education. 
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listeners. The same applied to changes in attitude. Studies by Freeburne & 
Fleischer (14) and Hall (15) investigate the effects of background music on 
reading efficiency. Hall, with 278 Grade VIII and IX pupils, finds music 
helps reading. Freeburne & Fleischer, with 208 students, find no significant 
differences. Both studies overlook the probable incentive inherent in the 
experimental situation; and it does not appear likely that musical accompani- 
ment would be an aid to long-term serious reading. McKillop (16) tackles a 
more fundamental aspect of reading, the interaction between the reader’s 
emotional attitudes and the material read. She gave 512 pupils in Grade XI 
attitude tests of the Negro versus white variety and found that the pupils’ 
responses on matters of factual detail in their reading were not affected by 
attitude, but there were significant differences in their interpretation and 
evaluation. This study, conducted on sound lines, would indicate a need 
for teaching pupils to read accurately and without bias. Though no indica- 
tion of the differences in reading efficiency in relation to attitudes is given, 
it is possible that poor readers are more susceptible to emotional bias and 
to read into the passage what is not there. But it does seem that reading skill 
alone does not make a good reader. Burton (17) attempts to determine the 
nature of literary appreciation with Grade XII pupils. He finds it related to 
verbal intelligence, social background, and silent reading skill, but all cor- 
relations are rather low. There is the usual difficulty of the criterion; and 
though the results are unconvincing, appreciation has been a neglected aspect 
of reading studies. 

College reading.—T he remainder of the studies deal with the reading dif- 


ficulties of college students. Such students appear to be a growing problem 


to American psychologists, and, during the period under review, are the 
major concern of those working in the field of reading. Krisa (18) found that 
his 20 adults showed no reversals in reading, but continued to show them 
with artificial symbols. He concludes that reversals in reading are mastered 
by training and practice, and not by maturation as is often stated. The 
number of cases is too small and the experimental conditions too artificial 
for this conclusion to be wholly accepted. Manolakes (19) trained an experi- 
mental group of adults, using a tachistoscope. This training had no effect 
on their reading eye movements. It remains to be found whether this applies 
to children, as there are many advocates of perceptual training as preparation 
for learning to read. Witty, Stolartz & Cooper (20) gave 42 students, all 
low scorers on the Co-operative English Text, a nine week intensive training 
which raised their reading percentile rank from 48 to 76. The permanence of 
the improvement is not stated. Barbe (21), in a similar twelve week course, 
obtained a 64 per cent improvement in rate of reading, which dropped to 
49 per cent six months later. Mouly (22) similarly found improvement, but 
points out that the personality of the students electing remedial reading 
may be a factor in the results. Johnson (23), with small groups of students, 
found improvement in reading both from training in techniques and from 
group therapy. In all such studies, it must be remembered, the validity of 
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the findings depends on the validity of the measuring instruments, and it is 
all too easy to balance one set of test results against another and claim im- 
provement in reading as the result. Timely warnings are given by Carillo & 
Sheldon (24) who point out that flexibility of reading rate is more important 
than mere speed, which is so often taken as the criterion of reading efficiency. 
The appropriate measure should be the relationship of speed to difficulty. 
Barbe & Grilk (25) show that reading rate and intelligence correlate only 
r=.12, whereas if other reading abilities are considered as well, the correla- 
tion rises to r=.72. Finally Dahlberg, Rosewell & Chall (26) stress that read- 
ing efficiency is not merely a matter of skills; disabilities are often associated 
with psychological disorder, which must be taken into account in any scheme 
of remedial training. 

The reviewer remembers a well-known American psychologist asking 
an equally well-known Scottish psychologist what was done in Britain 
with students with reading disabilities. The answer was that we do not let 
them into college. The absence of any British contributions to the literature 
reviewed is an indication of the way in which educational research is in- 
fluenced by educational needs. From the safety of the other side of the 
Atlantic the reviewer may perhaps offer a few critical comments. The) 
presence of college students who cannot read effectively would seem to call | 
for greater attention to the school reading programme, particularly in | 
diagnosis of difficulties in the upper grades. There is little sign of such/ 
interest. Another doubt concerns the efficiency of intensive courses at a late} 
stage in reading progress. There are too many studies which take reading 
tests as their own criterion; more confidence would arise from proof that such 
courses resulted in improvement in college work, which Mouly uses, and in 
a longer term follow-up of the benefits gained. Such researches would be more 
difficult than those cited, but more valuable.? 

Other school subjects The other school subjects are poorly represented. 
In art, there are one or two studies of interest. Whipple (27) assessed the fea- 
tures of book illustration most interesting to Grade VI pupils. The favoured 
features were a definite centre of interest, action, colour, and size. Readers 
with many pictures were preferred to those with few. Two very neat investi- 


2 The reviewer received too late for inclusion in the text a report of the Annual 
Conference on Reading, edited by Gray (107). It is impossible to summarize such a 
comprehensive survey, consisting as it does of some 50 papers on various aspects of 
the topic. The opening paper confirms the view that the field of study of reading as 
such is virtually exhausted, but points out that a new field is now being explored, the 
relationship of reading skills to other areas of the school curriculum. It is to the latter 
topic that the volume is devoted, and the discussion is thorough. Throughout, the re- 
quirements of the high school are kept in mind, and in this respect, as in others, the 
volume goes some way towards meeting the reviewer's criticisms about neglect of cer- 
tain aspects of the study of reading. The contributors ask questions rather than at- 
tempt to answer them, and it is likely that further research in reading will arise from 
the questions they ask. 
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gations by Rudisill (28) and French (29) discuss pupils’ development in 
appreciation. Rudisill, using a variety of coloured and uncoloured pictures, 
finds that realism rather than colour appeals to children from kindergarten 
to Grade VI. The conclusion is that adults over-emphasise the attraction of 
colour for children. French finds that young children are quite consistent in 
their judgements of pictures. They prefer clear outline and isolated objects, 
similar to their own productions. Adults and pupils of Grade V upwards 
prefer impressionistic pictures, with true perspective. There is a fairly clear 
maturational change both in drawings and appreciation, and the findings 
are well worth study by teachers endeavouring to train children in art. The 
remaining studies are slight. Elkisch (30), in a subjective analysis of Grade V 
and VI boys’ drawings, concludes that they think of themselves as machines. 

, Knopp & Richards (31) find girls’ drawings more complex than boys’, and 
younger children’s more constricted. The main interest in these investiga- 
tions is in appreciation, and studies like those of French and Rudisill are 
contributions to the understanding of children’s development which could 
profitably be carried further. 

An over-elaborate factor analysis of a spelling test of 100 words by 
Knoell & Harris (32) does not leave us much wiser. Wiliard (33) finds 19 
high school pupils given formal spelling instruction were 18 per cent better 
than a control group, which is fair enough. Dressel, Schmidt & Kincaid (34) 
reveal a state of affairs among college students which requires attention. 
Excluding English, the hours per year spent in writing varied from 131 to 
4. It was found also that frequency of essay writing in subjects other than 
English gave rise to no improvement over a year. 

The work on arithmetic is disappointingly small. As a skill, arithmetic is 
as essential as reading, and yet hardly a single article is found dealing with 
the learning and teaching of arithmetic. Crawford & Zylstra (35) show that 
the level of accuracy with which high school pupils can estimate quantities 
is low, and that ability to estimate size, volume, etc., is independent of IQ 
and mathematical ability. The other contributions deal with the writing of 
numbers. Perry (36) points out the superiority of arabic over roman numer- 
als, and Wright (37) in an extensive discussion, examines the sources of con- 
fusions in numbering and the current practices in teaching. From these he 
derives the optimal system of numbers, including the small zero and round- 
topped three. The book is voluminous but readable, though the validity of 
the proposed system is not fully established. Halloran (38) shows that for a 
four-year period, Esperanto is a useful introduction to the study of French 
for poorer pupils, but is of no benefit to the abler ones. The four-year span 
of the experiment gives confidence in his findings. 

Measuring progress—The theoretical implications of measurement of 
educational progress have produced some thoughtful papers. English (39) 
discusses the concept of learning and maintains that there is no need for a 
systematic theory; learning is a technological term, and is established 
by an empirical measurement of improvement. Without necessarily adopting 
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this extreme empirical attitude, Durost & Prescott (40) stress the need for 
careful consideration of what is involved in measuring improvement. Units 
in both measurements should be comparable and equally variable at all 
levels, and there should be equal opportunity for improvement for all sub- 
jects, differences in curriculum, length of school year and so on, being taken 
into account. There is rather a counsel of perfection, but more attention to 
these requirements would produce more valid results. Tilton (41) also dis- 
cusses the question of a general learning ability, basing his findings on the 
intercorrelations of school subjects after a 20-month interval through Grades 
IV and V. He very properly emphasises that measures of improvement must 
be reliable as well as the tests used, and tentatively concludes that there is 
a general learning ability. This is an initial approach to a topic which calls 
for further research in two dimensions, the constancy of growth of the general 
learning ability and the maturational development of specific abilities. The 
theoretical implications are important, but the technique of investigation will 
require careful consideration. 

The year’s work in the school subjects has, with a few exceptions, been 
disappointing. Interest in this field seems to be stagnating. This is explicable 
in reading, where a large mass of research has still to be digested, but the 
drift of interest in the United States to evaluation and social relationships 
and in Britain to measurement and selection, does not wholly excuse the 
paucity of contributions to the scientific psychological study of the basic 
school subjects. The lack of interest in arithmetic is especially disturbing. 


SocraL ASPECTS 


Under this heading are such topics as interpersonal relations, home back- 
ground and interests, and attitudes to social questions. These topics are 
becoming of increasing interest to educational psychologists and reflect a 
shift of attention from the teaching of skills to the teaching of citizenship 
and how to live. Greater attention has always been paid in the United States 
to personal and group relations than in Britain, as the problems of American 
society are more manifest and urgent. The feature of the present period is 
an outburst of sociometry. 

This is still frontier research and much is exploratory and descriptive. 
Techniques are yet to be established, and the findings are far from conclusive, 
even at times appearing rather pointless. But implicit in it all is a felt need 
for adjusting the individual to a social group. The “‘isolate”’ is by implication 
requiring rescue, the ‘‘star’’ is approved. What will come out of it when sound 
working techniques have been established is not yet clear, but the positive 
purposes to which these techniques will be applied require watching. The 
pupil of independent mind and character may fare badly. 

Sociometric studies—Wrightstone et al. (41) use sociometric techniques 
to assess and improve school staff relationships. The claim is made, despite 
inadequate statistical support, that service on committees increases the 
acceptability of isolate teachers. Kuhlen & Collister (42) find the social 
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acceptability of Grade VI and IX pupils who leave school is lower than that 
of those who remain. The social acceptance is here clearly specific to the situa- 
tion, and cause and effect are not distinguishable. Holmes (43) shows that 
abler students have a better attitude towards the worth of others. Again a 
normative implication is present. Ausubel, Schiff & Gasser (44) investigate 
pupils’ awareness of their own and others’ sociometric status. Using four 
methods of assessment, mainly five point ratings, they find ability to assess 
status increases with age. Their ratings are not shown to be very reliable, and 
their conclusions are more sweeping than their evidence warrants. The 
Alexanders (45) ask whether socially acceptable children are best adjusted. 
Joe, who was most frequently chosen by a group of nine-year-olds, was clini- 
cally examined to see if he was well adjusted. He was not. Bretsch (46) sepa- 
rates 696 fourteen-year-olds into well, medium, and poorly accepted groups. 
There is a significant difference between the groups in respect of social skills 
and activities in favour of the well accepted, but there is no check upon the 
validity of the social skills ratings, and the differences are very small. The 
two Graces (47) find no relationship between verbal behaviour and the opin- 
ion of the rest of the group about how an individual would actually behave. 
The criterion is rather artificial; the article is full of jargon. Goodnow & 
Taguiri (48) use sociometry (choice of roommate) to find that boys tend to 
choose mates of their own religious denomination. They expect also to be 
chosen on the same principle. Hallworth (49) gives what is essentially a de- 
scriptive account of the personal relationships in a grammar school of pupils 
aged 11 to 16. Under the ambitious title of Studies in the Social Psychology 
of Adolescence (50) and under the editorship of C. M. Fleming are published 
four studies of social relationships in school. Richardson (50) groups a class of 
girls according to their sociometric choices, and claims improvement in 
English and social relationships in comparison with a control class. There is 
too much description and the evidence is essentially subjective. Forrester 
(50), in an inconclusive study of attitudes of adolescents to their own de- 
velopment uses questionnaire methods, but without check on their validity. 
The adolescents are concerned mainly with adventure and with growing 
up, but these findings may be as much a function of the questionnaire as 
the pupils. Shukla (50), also using unvalidated questionnaires, finds friends 
have common interests. Higginbotham (50) records discussions by Grammar 
School and Approved School girls, but little else. Taylor (51) used sociometric 
techniques in an attack on a more vital problem, the desirability of promotion 
by age in elementary schools. Over 1000 children in 38 classes were studied, 
and it was found that there was no relation between age and acceptability, 
this not justifying age promotion. A slightly lower constancy of social 
acceptance was found in progressive as opposed to traditional classes, which 
was taken to mean that teacher’s efforts at social adjustment were not wholly 
ineffective. This is a more competent research than most quoted in this sec- 
tion; but what they all add up to is not clear. Though strict standards should 
not be applied to exploratory research, yet some regard must be had for 
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scientific logic and relevance to educational problems. Though the chalk 
legends on school walls that AB loves CD have not yet attained the status 
of educational research, some of the contributions cited above, especially 
the British, are only about one stage beyond this. 

There are attempts, however, to bring order into the field. Moreno (52) 
objects to the extensive use of sociometric techniques without relation to the 
theory behind them. There is inadequate analysis of the group situation, 
ignoring of the observer-actor relationship and gross oversimplification of 
experimental design. Clark & McGuire (53, 54) jointly attempt to derive 
working indices from sociometric data. Assuming that a sociometric valua- 
tion gives information about the valuer as well as the person valued, a ‘‘socio- 
graph’”’ can be made showing social clusters and cleavages. Similarly they 
find two indices of peer status. The analysis is thorough and it may prove 
that the indices are practicable measures like the well-established IQ, but 
there does remain the doubt that social relationships are less constant than 
IQ is assumed to be and the data not susceptible to a single measure. Still, 
until something like these indices can be used, it is unlikely that anything 
of permanent value will emerge from the current sociometric studies. Wityrol 
& Thompson (55) compare the stability of two sociometric measures, Partial 
Rank Order and Paired Comparison, on Grade VI pupils over five weeks. 
Paired Comparison is both more rigorous and reliable, and the two methods 
agree with a median correlation of r=.7. This is the kind of investigation 
that should have been done before sociometric techniques were applied as 
freely as they have been. Whether sociometric techniques will prove a fruit- 
ful instrument of research remains to be seen, but much progress is unlikely 
till there are more investigations of the kind of the last two cited. 

Attitude studies—The attitude studies produced during the period under 
review are in continuation of those of the past few years and present no un- 
usual features, though they add to the fund of information. Estvan (56) 
makes a comprehensive study of the attitudes of eleven-year-old children to 
poverty. Distinctions between social class are shown in awareness of the 
effects and incidence of poverty, the lower class children having a more sym- 
pathetic, immediate, personal awareness. Children of high intelligence have 
greater recognition of poverty as a social as well as personal affair. Boys tend 
to look at poverty from an objective and father’s point of view, the girls 
more at the family and home aspect. The feature of this study is the care 
taken in experimental design, which lends confidence to somewhat subjective 
methods of interpretation. This research has a direct bearing on the teach- 
ing of social studies in school. Himmelweit, Halsey & Oppenheim (57) in- 
vestigate differences between grammar school (IQ, 115 plus) and modern 
school boys in their attitudes to social class. Sixty per cent of the boys did 
not understand what social class meant, and the remainder defined it in 
several frames of reference, e.g., work, income, politics, etc. They did, how- 
ever, agree with the sociologists in ranking occupations, though working- 
class boys overranked manual professions. The most significant finding was 
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that most boys expected their occupations to be higher than their fathers’. 
This study would suggest that the attitude of most boys to the social struc- 
ture is not far removed from that of the community in general. Prothero & 
Miles (58) take a representative group of adults, and find that their attitudes 
to race and religion are very similar to those of college students. Young, 
Mayans & Corman (59) find that 83 per cent of their Grade XI pupils have 
the same political views as their parents and the same complex of attitudes 
associated with these politics. All these studies raise the question of what 
effects school education has on the social attitudes of pupils. It appears, very 
little. 

Children’s interests—There are also a few studies of the effects of chil- 
dren’s interests outside school. Malter (60) analyses children’s comics, find- 
ing only about 5 per cent of the space is used for printed matter, and that 
humour and crime between them account for half the contents. Crime is 
not so predominant as it is alleged to be. McKellar & Harris (61) find Brit- 
ish children aged 8 to 14 years uniformly prefer radio to cinema programmes 
and prefer humorous variety and adventure serials. They find the children 
not uncritical of crime and adventure features, and like Malter, they conclude 
that the effects of such stories can easily be exaggerated. Another commonly 
held belief is deflated by Smith (62). In an experiment using control and ex- 
perimental groups, it was found that instruction by films led to gains by the 
more intelligent pupils, there being no evidence of gain by the less intelligent. 
Though the methods used were rather cumbersome, the conclusion appears 
sound enough. 


GUIDANCE AND SELECTION 


Educational psychologists in both Britain and the United States are 
faced with what is fundamentally the same problem, though the settings 
are different. The problem is to reconcile individual differences in ability and 
circumstances with an egalitarian educational policy. In the United States, 
where the students’ choice is freer thanin Britain, students are liable to elect 
courses for which they are not suited and become academic misfits. In Brit- 
ain the pupils are allocated to different types of secondary education and 
again misfits occur. In Britain the psychologists have to talk themselves out 
of misfitting; in the United States the psychologist has to talk the student 
out of it. In both countries there is constant need of valid and reliable assess- 
ment of an individual’s suitability for various courses of study. 

Prediction of academic success——Most of these studies are American and 
are concerned with college students. A review of the practice in many colleges 
is given by Cosand (63). Intelligence tests plus language-skill tests give pre- 
diction up to r=.8, though no single test appears to give more than r=.5; 
and high school marks appear to be better predictors than any single test. 
A comprehensive table of results from various sources is given. Bolton (64) 
reports that, of a battery of intelligence and scholastic tests, the best pre- 
dictor was the American Council of Education Psychological Examination 
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(A.C.E.), but that its prediction of college progress was considerably higher 
for the first than for the second year. This well-conducted study confirms 
previous findings. Similar results are given by Frederiksen & Schrader (65), 
who report that high school marks are slightly better than A.C.E. test 
scores. Doppelt & Wesman (66) investigate the use of Differential Aptitude 
Tests rather than general intelligence tests as predictors of academic progress 
in school, and find that these tests give a correlation of r=.7 with the Iowa 
Tests of Educational Development one year later. This study has the weak- 
ness of many such, balancing one set of test results against another, but the 
authors are aware of the weakness of their criterion. Sutherland (67) com- 
pares the predictive value of different types of arithmetic tests given to 
twelve-year-olds with their school marks two to three years later. Problem 
arithmetic is found to be the best predictor, but not so good as an intelli- 
gence test. Differences in time allowance for the tests made little difference. 

Validity of intelligence tests—In all these studies, the conventional in- 
telligence test still appears the best predictor. In Britain, the use of intelli- 
gence tests for selection of pupils for secondary education has recently been 
criticised. In England particularly, the accuracy of selection is important, 
as the many pupils’ chances of obtaining an academic secondary education 
largely depend on their performance on such tests. Vernon (68), giving the 
question of coaching and practice effects on IQ an airing, reports gains of 
about 14 points in IQ from coaching and lesser gains from practice, but the 
investigation appears to have been done in circumstances giving the maxi- 
mum effects, and the conversion of scores to IQ is suspect. Wiseman (68) 
reports an extensive experiment in which a control group gained about four 
points IQ, a coached group six points, and a practice group, eleven points. 
This reverses Vernon’s results, and Wiseman’s groups were both larger and 
more representative. Watts, Pidgeon & Yates (69) find that practice gives 
slightly greater gains than coaching, and their results agree fairly closely 
with Wiseman’s. The upshot appears to be that IQ on verbal group tests is 
raised by about seven points by practice and coaching, but that the greatest 
effect occurs after the first practice or coaching period; and if a practice test 
is given shortly before the test proper, substantial justice would be done. 
Emmett (70), justifying intelligence tests for selection, claims that less than 
2 per cent of the children are affected by coaching, but admits that about 9 
per cent of the grammar school entrants are affected. He also ignores the 
resulting lack of confidence in the larger proportion of the scores from which 
the suspect cases are drawn. He also reports a correlation of r=.84 between 
selection tests and school certificate examinations as evidence of the efficiency 
of the tests. But the criterion is not how well pupils perform in the grammar 
school; it is whether the tests have selected the best pupils for the grammar 
school. More relevant validation is given by a report on Edinburgh pupils 
(71). The Leaving Certificate successes are compared with test results of six 
years earlier, and it is clearly shown that the chances of any pupil, not se- 
lected for full secondary education, gaining this Certificate are extremely 
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small. This workmanlike study, together with those of Wiseman and the Na- 
tional Foundation (69) are sound contributions to this very important educa- 
tional problem in Britain. These studies are unlikely to lead to any radical 
changes in practice, but it is all to the good that current procedures should 
be examined, exaggerated claims exposed, and sound practices confirmed. 
The use of the IQ in educational practice has been under examination 
from other aspects. Estes (72) finds significant differences between the aver- 
age intelligence of upper and lower economic groups in Grade II, but no cor- 
responding difference in Grade V. It is not clear, however, whether this is 
attributable to the levelling effect of common education, or the retardation 
of less able pupils. With three-year-old children, Sacks (73) reported an in- 
crease in Binet IQ when favourable social relations were established with 
the children. Boger (74) found training with jigsaw and other geometric 
puzzles increased average IQ, and Pitts (75) showed the importance of read- 
ing skill in the intelligence test scores of negro girls. The larger question of 
intelligence tests and cultural environment is discussed by Stenquist (76) and 
Biesheuvel (77). The former doubts the value of a “‘culture free’’ IQ, which 
would lose the cultural heritage of the schools and advocates the use of spe- 
cific aptitude tests instead of a general composite IQ. The latter reaches the 
same conclusion through difficulties in standardising tests on a bilingual pop- 
ulation, who are to be educated in a common school system. It does not seem 
likely, though, that intelligence tests will fall into disuse, as aptitude tests are 
specific to a given field of behaviour, whereas in education the aptitudes re- 
quired are so many and various that the overall IQ is likely to remain the 
most practicable and efficient instrument. Periodic reminders of the limita- 
tions of IQ are not amiss, and the comments of both writers are pertinent 
School success, however, does not depend on ability only; social differ- 
ences also operate. Halsey & Gardner (78) find greater differences in social 
class between grammar and modern school children in England than can be 
accounted for by differential selection by intelligence. Even within the 
grammar schools, the children from the middle classes have better prospects 
of success than children of the same measured intelligence from the working 


, classes. Campbell (79) examined the home conditions of pupils who had been 


selected by test for different levels of secondary education and were found to 


| be misplaced. Those who failed to make progress were found to have poorer 


home background than those who were progressing better than expected. 
His conclusions are cautious and sound. These two studies throw further 
light on the factors influencing school progress, and it appears that the influ- 
ence of cultural and social differences in measured intelligence is not disad- 
vantageous in educational selection, but in fact is underestimated in intelli- 
gence test scores. The study of social differences in education is developing 
slowly, probably because of the egalitarian basis of educational policy, but 
the principle that educational progress is not a function of intellectual ability 
only requires both emphasis and further study. It is possible to be as scien- 
tific about social differences as about cognitive differences, but it is less popu- 
lar. 
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TEACHERS 


The characteristics of the teacher, no less than those of the pupils, have 
an important bearing on education. Educational psychologists have always | 
paid more attention to the learner than to the teacher; and it is gratifying | 
that the teacher as a person, as distinct from an applier of methods, is attract- 
ing some attention. The problems of the assessment of the teacher’s influ- ( 
ence on the pupils, the teacher’s attitudes and technical competence, and in| 
the selection of fit persons as teachers are very considerable, and have not | 
yet received the intensive study they deserve. Most of those engaged in| 
training teachers are aware of the subjective and arbitrary nature of their 
judgements, and any critique founded on sound psychological investigation 
is very welcome. 

Teaching competence——The main difficulty lies in the criteria of teaching 
competence. Orleans et al. (80) point out that the ultimate criterion is change 
in the pupil, but this must be considered in relation to at least three essential 
variables, the kind of teacher, the kind of pupil, and the kind of educational 
aim. Assessment is not a simple matter, and basic thinking on the subject is 
much needed. In their study of teacher competence, Grim & Hoyt (81) show 
awareness of the complex criterion by approaching the assessment through 
both the teacher and the pupil. They obtain both the pupils’ and the princi- 
pals’ points of view on certain topics and propose to compare the results 
with the rating value given to the teacher. Validation is awaited, but the 
approach is promising. Ryans (82) endeavours to unravel the complexities 
of the criterion by a factor analysis of a classroom observation scale, but the 
five factors obtained do not add much to the understanding of the problem. 
The identification of factors is rather naive. 

Teachers’ attitudes—A more thorough investigation into the various 
components of successful teaching is needed before the training, selection, 
and assessment of teachers can be placed on a scientifically validated basis. | 
The question of teachers’ attitudes has, however, attracted a few investi- 
gators. Ringness (83) examines the reasons students give for choosing teach- 
ing as a career; these reasons can be subsumed under 13 headings. A com- 
parison of two methods of assessing these reasons, however, failed to reach 
sufficient agreement to confirm their reliability. The investigation is incon- 
clusive, but seems worth repeating with larger numbers and a careful follow- 
up. Evans (84, 85) also reports negative conclusions in a study of the rela- 
tionship of favourable attitude to teaching to intelligence and socioeconomic 
status. With students in training as teachers, no correspondence was found 
between attitude and teaching marks, or between teaching marks and intelli- 
gence, though a small negative correlation was found between attitude and 
intelligence. Possibly this last is a reflection on training college courses, but 
the main conclusion seems to be that a favourable attitude to teaching is in 
itself an independent component. As college teaching marks cannot be taken 
as a measure of teaching competence, the connection between attitudes and 
successful teaching remains unknown. Shaw et al. (86) and Cook & Hoyt (87) 
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investigate the Minnesota Teacher Attitude Inventory. Shaw finds an im- 
provement in attitude of teachers after a two-week refresher course, but this 
is hardly validation. Cook & Hoyt find that attitudes appear to be independ- 
ent of length of teaching experience, but that there are variations according 
to type of school, size of school system, and subjects taught. If this Inven- 
tory is to be related to teaching competence, it appears as if several valida- 
tions will be needed, and the complexity may be sufficient to render the in- 
strument impracticable. Bach (88), using both a comprehensive and thorough 
battery of predictors and criteria, finds reasonable agreement within the pre- 
dictor variables and within the criterion variables, but virtually no correla- 
tion between them. Practice teaching, done here under unfavourable condi- 
tions admittedly, is in particular a poor predictor. Shaw (86) taking the 
rather weak criterion of supervisors’ assessments during students’ teaching 
practice, claims the judgement of a single interviewer to give the most reli- 
able and valid prediction. But in view of Bach’s findings, the application of 
this to teacher selection is more than doubtful. 

The feature of all these studies in teacher assessment is their negative 


‘conclusions. No positive pointer has emerged, and such studies must cause 


serious misgivings in the minds of those who train and select teachers. The 
investigations quoted are by no means superficial or incompetent, yet no 
foundation for teacher assessment and selection has even begun to be laid. 
This field of educational psychology is one which is crying out for systematic 


.and concerted investigation. The pupil is only one part of the educational 


situation, the teacher is another, and vital one too. 


PROBLEM CHILDREN 


Under this head are included studies of children who present special edu- 
cational problems as a result of exceptionally high or low intelligence, sensory 
defects, behaviour disorders, or differences in race or language. Such children 
have for long been a standing concern of educational psychologists, and this 
year’s contributions are in line with those of the past; no new ground is 
broken, but understanding of these complex topics is increased. 

Intelligence——The education of the mentally defective child was one of 
the earliest studies of educational psychologists, and a fairly satisfactory 
level of practice now seems to have been attained. The papers cited below 
are contributions on a broad basis to the assessment of mental defect. Por- 
teus & Corbett (89) examine the various statutory definitions of feeble- 
mindedness in the United States and find for children that educational, 
psychometric and psychiatric criteria are confused. They stress the need for 
a clear legal definition which could be uniformly applied, which would cer- 
tainly help educators in knowing what the special needs of these children 
were. It appears that a child may at present be certified in one state but not 


‘in another. Fraser Roberts & Mellone (90) find that the variance of IQ in 


the Terman-Merrill revision of the Binet test is not independent of age, as 
it should be; therefore extremely high or low IQ’s are not comparable from 
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age to age. A table of corrections to give uniformity of variance is provided, 
and some of the corrections are large. A child aged six, with tested IQ of 
70, has three points deducted, a child aged eleven with the same tested IQ 
has six points added. The importance of this for the diagnosis of mental 
defect among children is obvious. 

The education of the highly intelligent pupil is no longer being as neg- 
lected as it used to be. Wilson (91) finds that the artistic, musical, scientific, 
and mechanical abilities of gifted pupils are above average, but that there 
is a wide range of variation and that special abilities are not highly corre- 
lated with IQ. This raises the question of the kind of total education suited 
to those children, and the question of what abilities are being measured when 
the children are designated gifted. Blackmer (92) and Thomas (93) discuss 
the gifted student and the transition from school to college. Blackmer ad- 
vocates linking certain colleges and schools to avoid retardation, and Thomas 
finds that such students present no difficulties in the academic or social sides 
of college, despite their youth and not infrequent poor home background. 

Sensory defect—The pupil with sensory defect remairs a problem, and 
it is perhaps proper that the child with probably the greatest disability, the 
deaf child, should receive mention. Smith (94) finds deaf children of average 
intelligence in nonverbal tests, but very retarded on verbal tests, which is 
not unexpected. Albright (95) finds almost three-quarters of deaf children 
and adolescents are below average on the California Mental Health Analysis. 
These findings depend entirely on the validity of this Analysis, which is 
very doubtful. Neither of these studies of deafness can be called a major 
contribution. Barger (96) discusses the reading difficulties of congenitally 
aphasic children, most of them being boys. A mirror reading technique is 
found to be effective, but the important point in the article is the suggestion 
that many reading difficulties may be a result of mild undiagnosed aphasia. 

Delinquency.—This is another field in which interest seems to be waning. | 
The problem remains, but it is possible that attention has shifted to clinical 
treatment of individual cases, a topic which is not included in this review. 
The average intelligence of delinquent girls on the Wechsler-Bellevue —y 
gence Scale is compared with that of nondelinquent girls by Diller (97). | 
The finding is the usual one, but the delinquents are found relatively better | 
in performance than in verbal test items. England (98), making a distinction! 
between delinquent and problem children, finds the latter most distinctive 
in their use of colour in drawings, this being very slapdash and careless. The 
criterion seems to be subjective, and it is doubtful if the results have any 
practical application. Another aspect is discussed by Crane (99). Over three- 
quarters of a group of student teachers report having been members of gangs, 
and the boys’ activities were more antisocial than the girls’. These activities 
did not necessarily denote prospective delinquency, as the students had 
grown up into well-behaved adults, by Australian standards at least. Crane 
suggests that much minor delinquency may be only a necessary phase in the 
process of social development and is temporary. This sounds reasonable, 
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but it does raise problems in the field of prevention of delinquency in its 
educational aspects. 

Race and language.—Differences in race and language from the norm of 
the community will always be an educational problem. It is perennial in 
the United States, at least so long as a democratic concept of education co- 
exists with what attitude studies show to be a fairly prevalent resistance to 
certain racial groups. The question is, of course, much more than an educa- 
tional one, but psychological examination can at least help to establish facts 
and indicate certain lines of amelioration. Lee (100) demonstrates how the 
process of acculturation raises difficulties for Chinese children. Their accept- 
ance of American standards leads to a cleavage from the parents, and the dis- 
grace of appearance in court seems to intensify the cleavage. Boyd (101) finds 
that in a mixed school for Negroes and whites, the level aimed at by Negroes 
is the higher; this is another aspect of the influence on education of the felt 
inferiority of certain groups. Billingualism is linked to race differences, but 
it appears from a study by Fishman (102) that where racial absorption has 
occurred, bilingualism is of minor importance. In a Yiddish school, Fishman 
found no difference between the play and leisure activities of unilingual and 
bilingual children. A study by Anastasi & D’Angelo (103) suggests that lin- 
guistic and intellectual differences between whites and Negroes may be 
cultural in origin. No significant racial difference was found for preschool 
children in either language tests or Goodenough’s Draw-A-Man. The groups 
were carefully selected, and the procedure carefully planned, but the author’s 
impression was that the white children were more mature linguistically. A 
further study by Anastasi & Cordova (104) of bilingual Negro children found 
that it made little difference whether a nonverbal test was administered in 
English or in Spanish. The group was below average, but this is not necessar- 
ily the result of bilingualism as such. Johnson (105) concludes that neither 
verbal nor nonverbal intelligence tests are valid for bilingual children. This 
does not agree with Darcy’s (106) summing up of research findings. Most 
studies seem to show a more marked inferiority of bilingual subjects in verbal 
tests. The question must be left open, but as long as intelligence tests play 
a part in the educational system, the problem remains one of practical im- 
portance, and full information is vital. It may be, as some contend, intelli- 
gence tests are unsuitable instruments for the purpose, or it may also be 
that the tests are in fact reflecting cultural differences associated with bi- 
lingualism. The studies quoted above are suggestive, but not final. 

The whole problem of racial and cultural differences is one which psycho- 
logical research will in itself not solve. It is, nevertheless, desirable that edu- 
cational psychologists should continue to be aware of the psychological 
aspects of this question and continue to add to the common fund of educa- 
tional knowledge, for along whatever lines a solution is reached, the educator 
will be playing a vital part. The studies cited tackle competently specific 
aspects of the larger problem, and though none is in itself conclusive, a steady 
stream of such studies cannot but be beneficial. Time and knowledge are 
good solvents of prejudice. 
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CONCLUSION 


Though the scope of this review has necessarily been limited by con- 
siderations of space and the fact that a number of topics of interest to edu- 
cational psychologists are reviewed elsewhere in this volume, certain trends 
of interest are evident. There are some perennial major currents of interest, 
e.g., the learning of elementary school subjects like reading or the measure- 
ment of abilities; there are other lesser and more fluctuating currents deter- 
mined by changes in educational needs and policies, such as reading of college 
students or coaching in intelligence tests; and there are certain more ephem- 
eral currents, such as the prevailing indiscriminate use of sociometric 
techniques in researches for which these techniques are not always fitted. 
The individual contributions are, as is to be expected, a mixed bag. Most of 
those in the well established fields of investigation are of sound workmanship, 
but in more recently opened fields there is more trivial and poorly designed 
work than is really necessary. Research, the reviewer believes, should have 
reference to some general educational issue, either practical or theoretical. 
Rather too many investigations begin and end in themselves, and the need! 


for a clear and relevant criterion for any tests, procedures, or groups investi | 


gated cannot be overstressed. The point is well illustrated in the selection of 
teachers, where the investigators show themselves particularly aware of the 
need for a well founded criterion before further investigation is possible. 

Perhaps as significant as the centres of interest are the centres of neglect. 
Reading attracts considerable attention, Arithmetic is virtually neglected. 
Reading of college students is the subject of several articles, reading of high 
school pupils inspires hardly any. The interpersonal preferences and popu- 
larities of pupils and students are extensively described, their relationship 
to the personal development and qualities of the individual is almost passed 
over. Much attention is paid to the effects of practice and coaching on selec- 
tion tests for pupils, much less to the social and economic factors which ap- 
pear to play a considerable part in their school progress. It would be Utopian 
to demand that all articles conform to rigorous standards of research design 
and execution, and that all topics receive their due attention; but, in the 
period under review, it is felt that more could have been done to achieve this 
standard and this balance. It is to be hoped that future reviewers in this 
field may find evidence which will enable them to disagree with this verdict. 


f 
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STATISTICAL THEORY AND 
RESEARCH DESIGN! 


By HuBeErt E. BROGDEN 
Personnel Research Branch, AGO, Department of the Army, Washington, D. C. 


No very definite or novel trends seemed noticeable in the literature on 
statistical methods and research designs during the period covered by this 
review. Increasing specialization seemed pretty generally evident. Although; 
few publications have appeared on the topic, increased activity in applica- 
tions of statistical problems to high speed computers suggests that such} 
devices will soon have marked influence in this field. Foreseeable effects in-/ 
clude the tabling of functions hitherto regarded as unmanageable, and the; 
use of more exact but computationally difficult methods, such as principal 
component solutions in factor analysis. ; 

An overview of the literature having general interest to psychologists 
led this author to the conclusion that some of the more significant advances 
were achieved in the areas of test theory and personnel classification meth- 
ods. As a consequence, these areas were accorded special emphasis. It is pos- 
sible, of course, that the writer’s area of specialization may have been respon- 
sible for such emphasis. 


TEst THEORY 


Lord (39) sets about determining the effect of item difficulty distribu- 
tions on certain relations between test scores and a perfect measure of their 
underlying common ability. He investigates this for the case where multiple 
choice responses are involved and guessing is permitted, making use of Car- 
roll’s (14a) adjustment for guessing. A convincing argument is presented 
showing that the test scores necessarily have curvilinear relation to underly- 
ing ability. He infers that relations between test scores and ability should be 
expressed in curvilinear form. Lord then shows that if a group of individuals 
all have the same common underlying ability the intercorrelations of the 
items are zero. As a consequence, the curvilinear correlation of true score on 
the normally distributed underlying ability is always plus one; true score 
being defined as the score on an infinitely long test. The curvilinear correla- 
tion of test score on the underlying ability is shown to be equal to the square 
root of the test reliability. Hence, conclusions derived with regard to reli- 
ability can be readily applied in the case of the curvilinear correlation of 
test score on ability. From here he proceeds to the derivation of equations 
and presentation of numerical results. He concludes that ‘‘maximum test 
reliability and the maximum curvilinear correlation of test score on criterion, 
for a test composed of equivalent items, is obtained when the item difficulty 


1 The survey of the literature pertaining to this review was completed in May, 
1953. 
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is somewhat easier than halfway between chance level and 1.00,” and that 
“reliability and curvilinear correlation of test score on criterion decrease as 
the variability of the item difficulties increase.” 

With respect to the conclusion that curvilinear correlation of test score 
on underlying ability decreases as the dispersion of the item difficulties in- 
crease, Lord himself points out that he might not have reached this con- 
clusion had he investigated the curvilinear correlation of criterion on test 
score in lieu of the curvilinear correlation of test score on criterion. In ‘‘A 
Theory of Test Scores’”’ (which will be reviewed shortly) Lord indicates that 
Os+c is likely to be smallest when o;., is largest. The subscript s is the test 
score and the subscript c the underlying ability. It seems, then, very likely 
that the effect of item dispersion on the curvilinear correlation of under- 
lying ability on test score is not necessarily the same as the effect of item dis- 
persion on the curvilinear correlation of test score on ability. While the find- 
ings regarding curvilinear correlation of test score on underlying ability is 
important to a general theory of test scores, the major interest must center 
in findings bearing upon the curvilinear correlation of underlying ability on 
test score. The central problem is in reducing errors of the test when related 
to underlying ability not in errors of underlying ability in predicting test 
score. 

Cronbach & Warrington (18) attack the problem of item difficulty dis- 
tribution in relation to test efficiency by use of a technique which is in part 
the same as one also suggested by Lord (40). The technique will be discussed 
separately since it appears to be a definite contribution to test theory. 

Bivariate surfaces relating scores on various hypothetical tests to true 
ability were prepared by the technique mentioned above. The efficiency of 
the test for a wide range of cutting points (on the test) was then evaluated by 
computing biserial correlations between the test score and true ability, true 
ability being defined as a perfect normally distributed measure of the factor 
common to the test items. The test types investigated included those with 
free response items and, by application of Carroll’s development on guessing, 
tests consisting of multiple choice items in which guessing is permitted. 

Cronbach & Warrington conclude, in general, that tests with all items at 
.5 difficulty have optimal validity for a wide range of cutting scores. The 
range of advantage over tests having spread of item difficulties increases as 
the saturation of the items in the common underlying ability becomes less. 
The author of this review might note here that the data they present do not 
seem to warrant their emphasis on the superiority of the test with peaked 
distribution of item difficulties. With one exception, all of the patterns of 
item difficulty distribution they investigated gave very close to the same 
pig at most cutting points in dealing with sets of items having saturations in 
the common underlying ability typical of most tests. It seemed clear to this 
reviewer that the major conclusion could well have been that, within wide 
ranges, item difficulty distribution makes little difference in the efficiency of 
a test. The exception noted was a test made up of items 60 per cent of which 
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would normally be regarded as extreme (per cent passing below 20 or above 
80). 

Cronbach & Warrington also consider the optimal item difficulties for 
tests when the selection ratio is known in advance. Their results indicate 
that, contrary to general belief, maximum validity of a test having constant 
item difficulty is at a selection ratio such that the per cent selected is higher 
than the average difficulty level of the items‘as” measured: by the per cent 
passing the item. 

In one of the most definitive papers so far published in the area of test 
theory, Lord (40) develops formulae relating the characteristics of test items to 
various characteristics of the test score. The formulae developed include 
those yielding the bivariate surface of test score and ability, the product 
moment correlation between test score and ability, the frequency distribu- 
tion of test scores, the frequency distribution of true score (the score on an 
infinitely long test), the bivariate frequency surface of two tests measuring 
the same ability, the regression of test score on ability, the standard error of 
measurement at a given ability level, and the curvilinear correlation of test 
score on ability. 

In general, these formulae depend upon assumptions that the items of the 
test are dichotomous and that all items are measures of the same underlying 
ability. Normal distribution of ability is assumed for certain of the develop- 
ments but is unnecessary in some instances. Lord restricts consideration to 
items in which guessing is not a factor. 

Expressions for the regression of ability on test score, and for the stand- 
ard error of estimate of ability around test score proved too complex to per- 
mit a practical solution. However, the formula permitting the preparation 
of the bivariate frequency surface of test score and ability allow computa- 
tion of both of these expressions for any given hypothetical test. The devel- 
opment involved here will be discussed in greater detail later. 

Findings expected from theory were compared with those obtained from 
empirical data. In general, agreement appeared to be surprisingly high. The 
theoretical developments appear to offer a sound basis for predicting various 
characteristics of test scores from knowledge of the item parameters. 

From examination of the formulae, Lord concludes: (a) the regression 
of true score on ability is necessarily curvilinear (although with low item in- 
tercorrelations this curvilinearity has a negligible effect); (b) the standard 
error of measurement on a given test is ordinarily least for examinees at an 
ability level at which the test is least discriminating; (c) errors of measure- 
ment for typical test scores have binomial rather than normal distributions 
(while such errors are not correlated with ability they vary in magnitude 
with variation in ability and the skewness of their distribution also varies 
with ability); (¢) maximum discrimination at a given ability level is provided 
by a test, all items of which are passed by 50 per cent of examinees at that 
ability level; (e) unless the item intercorrelations are high, a test consisting 
of items of 50 per cent difficulty will be more discriminating for practically 
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all examinees than will a test having a spread of item difficulties; (f) the shape 
of the test score frequency distribution does not necessarily reflect the shape 
of the frequency distribution of ability. 

The technique discussed by Lord and used by Cronbach & Warrington 
appears to have special importance since it can be applied in attacking vari- 
ous as yet unsolved problems pertaining to the efficiency of a test score as a 
function of the component items or of the test score distribution. This tech- 
nique permits preparation of the full scatter plot showing the relationship 
between a hypothetical test score and the ability common to the test items. 

Basically the technique derives from the point that, within a group of 
individuals having the same underlying ability, the item intercorrelations 
will be zero. Knowing this, it readily follows that the mean, standard devia- 
tion, and, in fact, the full frequency distribution of the scores of a set of indi- 
viduals having the same underlying ability can be obtained for a hypothet- 
ical test by means of a binomial expansion. Since the distribution can be 
obtained for each array, an entire scatter plot showing the relation between 
test score and underlying ability can be prepared. Scatter plots so constructed 
depend upon the assumption that all items have the same difficulty and the 
same saturation in their common underlying ability. However, for a set of 
individuals having the same underlying ability the frequency distributions 
obtained for two sets of items, each having different difficulty levels, are 
statistically independent and may be readily combined by multiplying into 
each other. Thus, an array of a scatter plot can be obtained for tests having 
various difficulty levels and, as a consequence, the entire scatter plot can be 
constructed for such a test. 

Possible applications of the technique have not been exhausted by the 
work of Cronbach & Warrington. A number of writers have expended con- 
siderable effort in developing theories of item difficulty distribution in rela- 
tion to the discriminative power of a test, with attention directed toward 
factors such as the shape of the frequency curve. Many such investigations 
seemed highly speculative. It appears that the basic problem could have 
been more adequately handled by use of the technique we have just consid- 
ered. 

A tie-in of work on test theory with various developments in scaling 
should prove instructive. Lord started his development with consideration 
of an item’s characteristic curve, a construct very similar to those often 
used in scaling work. The basic objectives of scaling and of test construction 
seem to have much in common. Yet a major conclusion of Lord, and of 
Cronbach & Warrington, was that tests with peaked distributions of item 
difficulties are probably best. Such a conclusion seems in opposition to the 
very notion of scaling. 

A final comment on the foregoing developments seems in order. Lord’s 
work seems important in laying the groundwork for a generally acceptable 
theory of test scores. It should have a number of practical applications to 
test construction, particularly when it is desired to specify in advance a test- 
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score frequency distribution, to set up tentative norms, or to estimate in 
advance the proportion of cases in an area of a bivariate surface. It is un- 
likely, however, that direct application will result in validity higher than that 
obtained by following current practices. 

Lord (41) applies the method of maximum likelihood to problems asso- 
ciated with the estimation of underlying ability from the responses to the 
items. His development offers verification of certain conclusions reached 
by different reasoning in his earlier monograph (40). A new development 
which seems important allows estimation of confidence intervals for test 
scores within which the examinee’s true ability may be expected to lie. 


CORRELATION, REGRESSION, AND SELECTION 


The shrinkage formula proposed by Wherry has been commonly used by 
psychologists to obtain an unbiased estimate of a multiple correlation coef- 
ficient. More recently, with widespread discussion of the principle of cross- 
validation, most psychologists dealing with such prediction problems feel 
that experience with empirical data suggests that Wherry’s formula con- 
siderably overestimates the shrunken multiple. Regardless of the adequacy 
of empirical evidence on this point, two papers, neither of which have been 
published, show rather convincingly the source of the difficulty. 

Nicholson (46), and Lord (42) both point out that Wherry’s (or various 
equivalent) formula gives the expected universe value of the multiple cor- 
relation. Although the value of R obtained in the sample used in computing 
the weights is biased upward, this bias is not the only possible source of 
shrinkage. The universe value of the multiple is usually relevant only if the 
universe values of the weights are known. In practice, only estimates of these 
weights are available. Thus, further shrinkage must be expected when a re- 
gression equation is applied to a second sample as a function of the sampling 
error in the regression weights. 

While Wherry’s formula is correct, if the universe value of the multiple 
is desired, another formula is needed to estimate the efficiency of a regression 
equation when it is applied to a second sample from the same universe. Nich- 
olson and Lord both give an expression for the mean squared error of esti- 
mate when a regression equation is applied to a second sample. The formula, 


from Lord, is iii tes 

where £,E,S? is the expected value of the mean squared error of estimate of 
y when a regression equation developed in one sample is applied to a second 
sample, N is the number of cases in the sample used to determine the re- 
gression equation, & is the number of independent variables, and S?,.(z) is 
the mean squared error of estimate in the sample in which the regression 
equation is developed, (x) being the independent variables in vector nota- 
tion. 


Neither of the two authors gave the above formula in this exact form. 
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The formula can be obtained in both instances by simple manipulations. A 
discussion of the differences in their presentation would be unnecessarily 
time consuming. 

The formula given above cannot be simply translated to give an estimate 
of the cross-validity, the correlation between the predicted and actual values 
of y. The reasons for this are discussed by Nicholson and are of sufficient 
interest to justify added comment. It has been known for some time [see, 
for example, Schultz (51a)] that if a regression equation for estimating y is 
derived in one sample and applied in a second sample, all values of y are not 
estimated with the same precision. Precision is greater at the mean value 
and less for both high and low values of y. Consider the bivariate surface 
formed by plotting, in the second sample, the estimates of y derived from 
the first sample regression equation against the actual y values. The esti- 
mates of y would ordinarily be expected to equal the means of the arrays 
of y (in a scatter plot based on the sample in which the regression equation 
was derived). In the bivariate surface in the second sample, the estimates 
of y no longer equal the means of the arrays of y values. The estimates are 
biased and consistently overestimate for high positive values of y and under- 
estimate for values of y below the mean. This bias is of interest if we wish 
to estimate the absolute value of y. The above formula gives the expected 
error of estimate which includes the error involved in this bias. If we are 
interested in predicting only the relative values of y, as we imply in com- 
puting a cross-validity, greater accuracy can be obtained. 

’ Nicholson, in considering aspects of the problem just discussed, decides 
that interest should center in the prediction of the absolute rather than the 
relative values of y. Hence, a formula giving the expected cross-validity 
derived from a regression equation is not provided. This reviewer feels cer- 
tain that such a formula can be obtained by further derivation. Space does 
not permit detailed discussion of this additional problem. 

A further development by Nicholson is pertinent to many present day 
discussions of cross-validation problems. Suppose a regression equation is 
developed in sample one. Suppose, further, that the independent variables 
are known in sample two, but the values of the dependent variable is un- 
known. Nicholson asks whether the estimates of the regression equation of 
sample one can be improved upon, with or without use of the additional in- 
formation given in sample two. He demonstrates that the estimates provided 
by the regression equation of sample one are best in the maximum likelihood 
sense. 

f Flanagan (24) discusses short methods for calculating correlation coeffi- 
| cients that have been widely used for item analysis purposes. He investigates 
| tetrachoric correlations, biserial correlations, the well-known upper and 
lower 27 per cent estimates of product moment correlations, and several 
‘variants of this latter procedure. The variants of the upper and lower 27 
per cent estimates are obtained by double, or in some cases, triple tallies of a 
specified proportion of cases with extreme scores in the criterion. The process 
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is analogous to digiting on IBM tabulators in computing cross-products. 
While the method used for the upper and lower 27 per cent estimate creates 
a three-category variable (values of 1,0, and —1), the variants of this pro- 
cedure approach biserial correlations in that, in effect, a five or seven cate- 
gory variable is created. The computing labor involved with the several 
variants of the upper and lower 27 per cent seems generally less than that 
required in computing a tetrachoric correlation since the data for a substan- 
tial middle group need not be manipulated at all. By any of the short methods 
employed only two values (which might be regarded as weighted percent- 
ages) need be computed. These are used to enter tables previously prepared 
by Flanagan. Computing times are given which indicate that the short 
methods involve about the time required for biserial coefficients. 

To investigate the effectiveness of these various short-cut methods, three 
populations of 10,000 cases was established in which the product moment 
correlations were known to be .00, .45, and .90. One hundred samples of 100 
cases each were drawn from each of the three populations. The coefficients 
were then computed by the methods enumerated above. The mean values 
and empirical estimates of the standard errors of each of the coefficients 
were determined in this way and compared with each other and with the 
standard error of the coefficient, when it could be computed by formula. 

The data support Flanagan’s contention that various of the short-cut 
methods give standard errors considerably less than tetrachoric r and very 
little greater than those of rp;.. The findings seem in close agreement with 
those anticipated by Flanagan through consideration of the variance within 
categories in relation to total variance of the categorized variable. An added 
finding of considerable interest is the close agreement between the standard 
error of the biserial coefficient shown by the empirical findings and the 
standard error given by formula. The adequacy of the formula has, of course, 
often been questioned. Flanagan’s findings suggest that this formula might 
well be used with reasonable confidence in its accuracy. It must be admitted, 
however, that a larger number of samples would have to be used to allow a 
definite conclusion on this point. 

This reviewer agrees with Flanagan’s conclusion that it would seem gener- 
ally desirable to use the shorter methods and, if necessary, plan to obtain 
additional cases to offset the slightly higher standard errors associated with 
such short methods. 

Suppose, in solving a set of normal equations, the full inverse of a 
matrix is computed. This is often done so that the standard errors of the 
regression coefficients can be computed. West (54) develops computing 
methods for resolving the normal equations after deleting, adding, or re- 
placing a variable. The computing methods employ direct operations on the 
inverse matrix itself. 

Hamilton (31) suggests an iterative procedure for computing the inverse 
of a matrix. He reports that considerable time can be saved over that re- 
quired by conventional methods. 
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Doppelt & Bennett (20) investigated empirically the cost of hiring and 
training employees and showed graphically how the use of tests will reduce 
the cost per satisfactory employee. They found generally decreasing costs 
with increase in the cutting score used with the test. With very high scores, 
however, the curve reversed and increases became evident. Such increases 
were anticipated in an earlier theoretical analysis by the author of this 
review (9b). They are explained by the high cost of testing per employee 
hired when, say, only one out of 25 or 50 is selected. 

Arbous & Sichel (5) also discuss this problem. They point out that the 
high cost of hiring an employee, when a small proportion of those tested are 
hired, can be offset by use of short prescreening devices. Mathematical 
formulae, charts, and graphs are developed to establish the effectiveness of 
such prescreening devices. 

Several writers discuss various aspects of the problem of reliability. 
Dudek (21) believes that the term reliability has come to be used too loosely 
and suggests that the type of coefficient be kept in mind when reliability is 
discussed. He points out that inexact usage may lead to incorrect inferences. 
For example, increasing the length of a test will generally yield greater 
reliability as measured by internal consistency coefficients. Such increased 
reliability is not necessarily obtained, however, when test-retest coefficients 
are involved. Lawshe & Nagle (37), using five sets of empirical data, investi- 
gated the practical effect of weighting individual ratings according to the 
reliability of the rater before combining them into a composite. They con- 
clude that the decision to eliminate a low-reliability rater from the rater 
pool, or to apply differential weights to different ratings, has little practical 
effect even when there are wide differences in the reliabilities of the raters. 
Examination of their empirical findings gives strong support to their conclu- 
sions. It was noted that the improvements obtained were, in any event, 
very small. Often elimination of a low-reliability rater resulted in a lower 
reliability for the composite of the ratings. Psychologists attempting to 
improve the reliability of any composite of criterion variables by weighting 
according to reliability might well take note of their findings. Angoff (4) 
considers the problem of determining the reliability of a speeded test. With 
speeded tests it is well known that the usual internal consistency estimates 
are invalid. Angoff proposes a way of determining reliability which will save 
some time over that needed to construct an alternate form. In effect, he 
proposes that a short equivalent of a longer form be used for this purpose. 
Formulae are given for estimating reliability of the speeded test with this 
arrangement. 

Schmid (51) describes the steps in the use of sequential analysis in the 
selection of discriminating test items. Anastasi (2) tried an empirical study 
of sequential analysis as used in item selection. She started with use of ex- 
treme cases and added, in successive steps, the cases closer to the mean. 
She concludes that sequential methods are useful in item selection since 
the results obtained correspond to those that would have been obtained if 
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all cases had been used. It seems likely that her findings could be explained 
on other bases. 

In an article reviewed elsewhere in this chapter, Flanagan (24) gives) 
both theoretical and empirical evidence supporting the idea that short meth- | 
ods of estimating the product moment item validity involving extreme] 
groups are almost as effective as biserial estimates of the product moment} 
validity. His findings may explain Anastasi’s results. Kuang (36) reports’ 
another empirical study of three methods of item selection. He employs 
Davis’s indices, biserial correlation, and probit analysis methods, as means 
of improving the reliability of tests. He finds no significant differences in the 
reliability of the scores for sets of items resulting from the use of the three 
different methods. 

Webb & Jones (53) take note of the controversy between those using the 
“Poisson fit” technique and those using correlational techniques in the 
study of accident proneness. They report uses of the two techniques which 
give very similar results. A method of their own, labelled the binomial cor- 
relation method, is found to give identical results with a ‘‘Poisson fit” tech- 
nique developed by Newbold (45a) and Cobb (14b). Burke (11) shows mathe- 
matically that the two methods are identical in their end effect. 

This reviewer was struck with the similarity of the basic ideas behind the 
accident proneness methods to those used in computing reliability coeffici- 
ents of aptitude tests. The Newbold-Cobb technique involves, essentially, 
a comparison of the obtained variance of the distribution of the number of 
accidents with that to be expected from chance alone. A similar comparison 
is basic to the Kuder-Richardson reliability formulae, where the variance of 
a total score is related to the variance expected if the items have zero inter- 
correlations. Correlational methods used in the study of accident proneness 
have obvious similarity to split-half or odd-even reliability coefficients. 

Fisher (23) gives a readable, although somewhat condensed, discussion 
of some general logics related to the idea of sequential tests. Some points 
he makes regarding the logic of selection, as used by eugenicists to improve 
animal stocks, are pertinent in the present context. Fisher argues that if 
attention is directed to only one dependent variable, the resulting improve- 
ment in that variable may be offset by losses in another dependent variable. 
The applicability of his points to the logic of criterion construction in person- 
nel selection problems is fairly apparent. 

Geary (26) discusses Berkson’s (7a) distinction between controlled and 
uncontrolled measurements of the independent variable. With error of 
measurement in the independent variable, use of uncontrolled measure- 
ments will give biased estimates of the slope of the regression line. In con- 
trolled observation of the independent variable, no bias will result even 
though error of measurement is present in the independent variable. Con- 
trolled measurement might be likened to measurements obtained through 
manipulation or experimentation, while uncontrolled measurement is con- 
cerned with observational data not involving manipulation. 
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Berkson’s conclusions were limited to the case in which the true relation- 
ship is linear. Geary considers the implication of this distinction for the 
nonlinear case and develops a theory of estimation and sampling distribution. 

Gaier & Lee (25) review a variety of approaches to nonadditive or con- 
figural methods of scoring or combining variables and reach rather enthusi- 
astic conclusions concerning their probable value. It should be noted that 
most such approaches when used for prediction can be duplicated or approxi- 
mated by regression techniques involving higher order curvilinearity. In 
view of findings reported in Air Force research during World War II that 
simple culvilinearities noted on samples of 1,000 or more failed to reappear 
in subsequent samples, it seems likely that the promise of configural methods 
will be most evident if analysis is confined to a single sample and cross- 
validation is not attempted. More evidence of the value of such methods with 
adequate cross-validation is certainly needed before wide-spread use can be 
justified. 

PERSONNEL CLASSIFICATION 


Several authors have discussed solutions to the personnel classification 
problem. The classification problem as approached by these authors may be 
stated in the following manner. Suppose a matrix of m columns and N rows 
whose elements, C;;, are measures of the productivities of N individuals in 
each of » jobs. If individuals are assigned to jobs according to specified 
quotas, a sum S can be defined, the elements of which are drawn one from 
each row and so that the total number from each column agrees with the 
number specified by the quota for this job. This sum may be regarded as 
the total productivity of assigned individuals. The objective is to maximize 
S. Two cases should be distinguished. If the total of the quotas equals N, 
complete classification is involved. A second case occurs when the total of 
the quotas is less than N and a portion of the group is to be rejected. 

Lord (43) reformulates and gives a more rigorous proof of a solution pre- 
sented earlier by Brogden (9a). His solution is also stated in more general 
form since Brogden’s development applies to the case in which a portion of 
individuals are rejected entirely. Lord shows that if cases are represented 
spatially, the boundary between the regions for any pair of jobs must be 
defined by some constant value of the difference between the scores on the 
two jobs. There would be 4n(m—1) such boundaries and, consequently, 
3n(n—1) constants applying to an equal number of possible difference 
scores. Each boundary is an (n —1) space. Since all $n(m—1) boundaries pass 
through the same straight line, it is possible to make a translation so that 
they will all pass through the origin. It can be shown to follow then that the 
equations for the 4n(n—1) boundaries can be written with constants, 
one for each estimate of productivity, replacing the 3n(m—1) constants. For 
example, the constant k12 required to define the boundary between jobs one 
and two would be replaced by the difference between v, and v2, where ki: is 
an instance of the 4n(n—1) constants while v; and v2 are instances of the n 
constants. 
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Lord also provides a solution for the case involving a rejection group 
which is effectively the same as that given by Brogden. 

The integrals from which, in theory, the 2 constants could be determined 
for any set of quotas are written out, but Lord decides that the expressions 
thus obtained are too cumbersome to be of practical use. 

Votaw (52) applies the simplex method used in solving the Hitchcock- 
Koopman transportation problem to the personnel classification problem. 
It will be convenient to take some liberties with Votaw’s statement of the 
optimal solution to allow a more direct comparison with that of Lord or 
Rao (49). 

In effect, Votaw indicates that an optimal solution is obtained if the 
proper number of individuals is placed in each job and if for any individual i 


Cik + 1% + ui; 2 Cig +; + 4 


where & is the job to which the individual is assigned and j is any of the re- 
maining jobs. The v’s in the above equation were defined earlier while the 
u's are constants applying to individuals. The reader will notice that 1; is 
repeated on both sides of the equation and is, consequently, unnecessary 
for definition of the optimal solution. The w’s are necessary, however, in 
solving sets of equations used in the simplex method. 

To solve the problem encountered when a portion of the population are 
to be rejected, a dummy job is set up with the productivity estimates of all 
individuals in this job being zero. It can be seen that individuals placed in 
this dummy job have zero contribution to S. Since their contribution to S 
would be zero if they were rejected, the use of a dummy job in this way 
gives results equivalent to those obtained with a more direct approach. 
This rather clever device permits use of the simplex method in the same way 
as it is used when no cases are to be rejected. Votaw also considers a number 
of additional special problems and shows how adaptations of the simplex 
method can be used to solve them. For example, the method can be used to 
minimize the cost of training. 

In general, the methods proposed by Votaw have been rigorously de- 
veloped and include solutions to a number of related or subsidiary problems. 
An iterative procedure for accomplishing the actual allocations of men to 
jobs is given as an integral part of the mathematical development. Provisions 
are made to avoid degenerate problems. The principal difficulty seems to 
be in the labor required to accomplish the solution. Since the solution 
becomes lengthy if the number of individuals or jobs is large, Votaw prefers 
to group them in classes. It is possibly an open question whether the use of a 
small number of classes will materially affect the efficiency of the final allo- 
cation of men to jobs. 

Rao (49) proves a lemma which, in effect, provides a solution to the classi- 
fication problem. He first gives the proof for a solution needed when it is 
desired to classify so as to maximize the product rather than the sum of the 
scores of classified individuals. While his proof is compact and may be of 
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interest for that reason, it will not be discussed in detail at this point since 
the end solution is essentially the same as Lord’s. 

Rao’s lemma may be stated as follows. If there exist quantities, v;, - + +, 
v, such that each element, Cx, of the Q; elements chosen from the k’th 
column, satisfies the relationship 


Cir + 1 2 Ci5 + 0; 


and if similar relationships held for elements similarly chosen from any 
other column using the same set 2;, - - - , ¥n, then for this choice of Q,,---, 
Qn elements the sum S is a maximum. Q), - +: , Q, are the quotas for the n 
jobs. The remaining symbols were defined earlier by this reviewer. 

Some general comments on the several solutions to the classification 
problem so far considered should be helpful. First, it should be noted that 
the end solutions achieved are all effectively the same. While it might 
appear that Votaw’s u,’s introduce a real difference, we have already indi- 
cated that the u;’s are pertinent to the iterative procedure and have no 
bearing upon the final solution. In fact, any set of u;’s may be added to vari- 
ous individual’s scores without affecting the final outcome, except where a 
reject group is involved. 

One basic difference in the approach of Votaw and that of Lord and Brog- 
den should be made explicit. The simplex method of Votaw starts with any 
allocation of men to jobs that satisfies the specified quotas. He then solves 
a set of equations and interchanges men between two jobs in a way that will 
give maximum improvement in over-all productivity. Each of the successive 
steps in reaching the solution improves the sum total productivity. By con- 
trast, the iterative method proposed earlier by Brodgen yields a maximum 
allocation sum at each stage. An approximate fit to quotas is taken as a 
starting point with each successive step aimed at an improved approxima- 
tion to the quotas. The latter type of iterative solution seems in general 
much faster and more adaptable to use with large N’s. The iterations are 
based on subjective considerations, however, while Votaw’s procedure boasts 
objective mathematical bases for each iteration. The advantages of the latter 
may be important if it is desired to use modern high speed computers. 

Applications of the classification theory can, in some instances, be quite 
simple and straightforward although the bases for such application were not 
made explicit in any of the papers just discussed. Suppose a set of v,;’s have 
been determined for specified quotas using a large sample. If the samples 
were sufficiently large, operating use of the findings in a second large sample 
could be as follows. First, 1, is added to all of the productivity scores for 
job 1, v2 to all of the productivity scores for job 2, etc., for all jobs. Classifica- 
tion could then be accomplished by assigning each individual to the job in 
which he received his highest score. Thus, if the v;’s could be determined 
beforehand, in some way, application is extremely simple. The basis for such 
application is readily seen by relating the steps involved to the equations 
or solutions previously given. 
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Two as yet unpublished papers on the classification problem which have 
come to this reviewer's attention will be briefly discusseed. Paul Dwyer gives 
a clear and readable account of theory and practice relating to classification 
problems. He develops new proofs and proposes a method of ‘‘optimal re- 
gions’”’ which permits a more rapid solution for the v,’s. Paul Horst proves, 
for the two-job case, that if estimates of productivity are used as the 
C;;'s, these estimates are ‘“‘best” if determined by the conventional least 
squares procedures. 

To conclude, there seems little question but that the nature of optimal 
solution to the personnel classification problem is well established. Various 
iterative procedures for arriving at this solution are known, and improve- 
ments on these will probably be forthcoming. It is possibly unnecessary to 
add that a host of related problems still remain to be solved. 


Factor ANALYSIS 


Burt (13) reviews at length the literature on significance tests related to 
factor analysis. He concludes that no really satisfactory solutions are avail- 
able. The more rigorous tests are too elaborate and are still not fully rigorous, 
while the more practical tests are based on seemingly untenable assumptions. 

The main theoretical difficulties arise from (a) the fact that the errors of 
the observed intercorrelations are themselves correlated, and (b) multi- 
variate analysis is not equipped to deal with the assumptions of specific 
factors. The only apparent exception to Burt’s general conclusion relates to 
the principal component factor solution where exact tests are available, 
permitting one to decide when to accept factors as established. He concludes 
that no very rigorous solutions are available when centroid or group centroid 
procedures are used in factoring. 

In treating the problem of when to stop factoring, Burt again feels that 
none of the solutions so far pr: posed are very useful. He states his own prefer- 
ences in the matter which involve extraction of several centroid factors be- 
yond the point where the loadings ‘“‘appear’’ to be significant. 

Guttman (29) discusses two theorems that had been presented in an 
earlier paper of his, and considers their relation to group factoring methods 
and to the 1actoring of score (as opposed to correlation) matrices. He also 
considers the “‘inverted’’ factor methods in relation to the factoring of score 
matrices. 

His first theorem is contained in the matrix equation 


G, = G — GX'(XGX') XG 


where G, is the residual matrix, G is the correlation matrix to be factored 
and X is a matrix of weights of the order s by n, s being the number of factors 
to be extracted and nm the number of variables. The matrix X may take vari- 
ous forms according to the methods of factoring actually used. In its simplest 
form the weight matrix could consist of 0’s and i s and could represent the 
experimenter’s identification of variables used to define the group centroids 
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for the s factors to be extracted. Proper definition of X can, in theory, 
eliminate the need for rotation. 
His second theorem is contained in the matrix equation 


S, = S — SY'(XSY’)7XS 


where S; is the residual score matrix, S the original score matrix and Y a 
matrix of weights of the order s by m, s being the number of factors to be 
extracted and m the number of cases. X is defined as in his first theorem. 
This theorem implies that factors can be extracted directly from the test 
scores. Guttman discusses this problem in detail and relates his discussion to 
other proposals to accomplish this end result. The point is made that the 
choice of Y in no way affects ‘“‘the common factor space of tests as studied 
by X.” Similarly, the choice of X in no way affects the results from a given Y. 

It seems clear that Guttman’s earlier article contains some definite contri- 
butions that have often been overlooked in subsequent discussions of similar 
problems. It also seems clear that the matrix equations given, plus the 
accompanying more detailed discussion, are not equivalent to a fully de- 
veloped method for factoring a matrix of intercorrelations. 

A point of Guttman’s on the general logic of experimental designs in 
factor analysis will bear emphasis. Guttman suggests that hypctheses con- 
cerning factor structure can be explicitly stated in the weight matrix X and 
the analysis then used as a check on the initial hypothesis. By conducting 
analyses in this fashion, the possibility of capitalizing on chance error can 
be avoided. His pvint seems cogent in view of the meager amount of informa- 
tion available on sampling errors of factor loadings together with the exten- 
sive bias and shrinkage found in prediction studies involving cross-validation 
designs. It must be realized, on the other hand, that statement of hypotheses 
with sufficient precision to make Guttman’s approach a fruitful one is, 
possibly, easier said than done. 

Carroll (14) proposes that a new matrix rt be formed which contains the 
squares of the entries in the “‘simple structure” matrix V, and that the major 
requirements of simple structure will be satisfied by minimizing the non- 
diagonal entries of r’r. 

Carroll’s formulation provides a unique solution. However, he indicates 
that the solution providec is an approximation to simple structure, and that 
the computations are too :11 olved for most applications. Adaptation to high 
speed computers may allow widespread application. The solution is an 
approximation since the presence of complex tests makes the primary axes 
more highly correlated than would be the case if they were located by 
graphic rotations. 

When tests and people are located in the same factor space, a test having 
the same direction cosines may be called the test-equivalent of that person. 
Sandler (50) suggests that such test equivalents of persons will aid in rotating 
and interpreting factors. He provides the necessary equations for identifying 
them. 
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Adcock (1) discusses Thurstone’s multiple group method of factoring 
and suggests that improvement would obtain if an alternative method were 
available for rotating the oblique group centroids to orthogonality. Such an 
alternate method should keep the resulting orthogonal reference axes as close 
as possible to the positions suggested by all the clusters and thus reduce the 
labor of rotation. A precise solution attributed to Ledyard Tucker is given 
together with an approximate solution which reduces the computations in- 
volved. 

It is of interest to note that principal component solutions are being 
obtained on high speed computers in less than an hour’s time. Wrigley & 
Neuhaus (55) report on the use of such computing methods. D. J. Wheeler, 
R. T. Gregory, and R. F. Klippinger are credited with developing the pro- 
grams on the ORDVAC electronic computer. The program can, presumably, 
be made available to others. 


ANALYSIS OF VARIANCE AND RELATED DESIGNS 


The statistical literature on analysis of variance is extensive and special- 
ized. Developments with regard to both design and methods of analysis have 
far outstripped the practices of those who use them in psychology. It seemed 
wise, as a consequence, to avoid attempting full coverage of new develop- 
ments and seek papers that emphasize the problems of design and analysis 
pertinent to current usage in this area. 

Kogan (35) gives a clear and well-organized discussion of the use of 
variance designs in psychological research. While not all of the points he 
makes are new, many will bear repetition. Instances are cited in which the 
investigator fails to use methods of analysis which are appropriate to the 
design which he sets up. For example, equality of means and standard devi- 
ation may have been forced, but this feature of the design was not taken into 
account in the analysis of results. 

Kogan criticizes the use of ¢ tests in order to check on the significance) 
of the difference between various pairs of means. He points out that such ¢/ 
tests are not independent, and cites Cochran & Cox (15b) who show that} 
apparent significance will be found with much greater frequency than would| 
be indicated by the chosen level of significance. It seems that the problems! 
involved in the use of a series of ¢ tests recur frequently in various types of! 
statistical applications. 

Kogan stresses, as others have done before, the practical advantages of 
“confounding” in the planning of experiments. Repeated findings of insignifi- 
cant interactions should be more often utilized by investigators as the basis 
for a confounded design. The basic idea is to improve precision of selected 
comparisons while neglecting others which prior investigations have shown 
to be unimportant. He points out that experiments involving repetitive 
measures of criterion variable for one or more groups of subjects characterize 
many psychological researches but do not appear in other research areas 
for which analysis of variance designs have been developed. He regards 
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methods in current use by psychologists as somewhat questionable, pointing 
out that, in general, successive measures takes on the same subjects should 
not be regarded as either randomly distributed or independent. There seems 
to be need for further theoretical work on the topic of repeated measure- 
ments, preferably with the aid of mathematical statisticians. 

Kogan notes the undesirability of designs which allow all subjects to 
undergo several different experimental treatments in the same order. Such 
a design is increased in sensitivity by the separation of individual variations 
in regression on time from the estimate of error, but this does not overcome 
the confounding of differences in ‘‘set’’ means with possible temporal effects 
of fatigue, adaptation, etc. Such confounding of the main experimental fac- 
tor should be avoided by randomizing the sequence of experimental condi- 
tions among the subjects. 

Kogan notes that most psychologists dealing with higher order multiple 
classification or factorial designs proceed from the bottom up in setting up 
F ratios from an analysis of variance table. That is to say, successive inter- 
actions are tested starting with the higher order ones and, if all are found 
to be insignificant, a denominator for the F ratio is used which consists of the 
pooled interactions and residual terms. This procedure is criticized as having 
no sound basis. He refers to Cochran (15a) for a discussion of the problems 
involved and a solution that will often be applicable. In some further dis- 
cussion of individual tests of significance in connection with analysis of vari- 
ance, Kogan suggests that procedures presented by Tukey (51b) are the most 
simple and practical. Tukey’s procedure involves use of a ‘“‘gap”’ test, a 
“straggler’’ test, and a new F test to subgroups among the treatment means 
after a significant F has been found for a set of treatment means. Kogan 
points out that psychologists have given consideration to the Neyman- 
Pearson Type I errors and have generally omitted consideration of the pos- 
sibility of Type II errors. While most investigators appreciate that failure to 
reject the null hypothesis does not prove the null hypothesis, there is a strong 
tendency to draw conclusions in a fashion that suggests that the null hypothe- 
sis is proven. He suggests that the power function of the analysis of variance 
test could be useful for estimating the sample sizes needed to reduce the 
probability of a Type II error to some specified level. Consideration of the 
power concept in relation to current psychological research leads to the con- 
clusion that many experiments are carried out with a design such that there 
is too high a probability that findings of practical significance will fail to show 
statistical significance. 

Archer (6) describes greco-latin designs adapted for use in learning stud- 
ies. Since, in learning studies, the subject rarely learns the same content 
under all conditions, but the content must be presented in the same order 
for all subjects, a simple latin square is inappropriate. Order of presentation 
would be confounded with a practice effect to be described. To avoid such 
confounding, order of, say, a list of words can be presented according to a 
“latin” square independent of the latin square for conditions, or in other 
words, a greco-latin square can be used. 
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Practice effects of the usual sort can be handled by designs previously 
developed. The design proposed by Archer deals with practice effects that 
generalize from the learning of, say, one list of words to the learning of a sec- 
ond series of words. With such practice effects the several conditions of 
learning under investigation would not be experimentally independent. 
Two basic designs plus a variant of one of these designs are described. The 
chief limitation noted is the loss of the possibility of evaluating higher order 
interactions. 

Apart from Archer’s paper there is a considerable body of literature 
appearing ir the Annals of Mathematical Statistics, Biometrika, and other 
similar journals which set forth designs, or sets of designs, for a considerable 
variety of problems. There is no attempt at thorough coverage of such papers. 
A brief discussion of certain of the more pertinent papers is given below.. 

Patterson (47) is concerned with construction of the type of experimental 
designs in which treatments are applied to the experimental material in a 
number of successive periods, each experimental unit receiving a different 
treatment in each period. Elements of balance required for simple statistical 
treatment are set out. 

When partially balanced incomplete block designs were first developed, 
there was concentrated attention on designs involving two or three accuracies 
in the treatment comparisons. Nair (45) deals with designs involving four 
accuracies, and the necessary formulae are worked out in detail. Illustrations 
are given. 

An experimental design in which there is no randomization by the experi- 
menter, except possibly in naming the treatments, is called systematic. 
Cox (17) reviews recent work on systematic designs for use when the residual 
variation is (a) auto-correlated, and (b) formed from a trend plus random 
error. Cox points out that systematic designs have been considered ques- 
tionable following criticisms by R. A. Fisher. He adds that randomized de- 
signs (as opposed to systematic designs) always give unbiased estimates of 
error aud that this is a completely convincing argument for randomizing in 
all cases except those in which we have special reason to expect a system- 
atic design to be either more accurate or more convenient. 

Kempthorne (34) discusses a class of designs using blocks of two plots. 
Bose & Connor (8) consider the combinatorial properties of group divisible, 
incomplete block designs. Connor (16) discusses some relations among the 
blocks of symmetrical group divisible designs, while Bose & Shimamoto (9) 
give a long detailed discussion of partially balanced incomplete block de- 
signs involving two associate classes. 


TESTS OF SIGNIFICANCE 


Cochran (15) presents a readable account of the history of x? test es, 
discussion of a number of the more important papers involved. His sub- 
sequent discussion of the practical use of the test will be of interest to psy- 
chologists. It is customary to recommend in applications of x? that the| 
smallest expected number in any class should not fall below 5 or 10 cases. 
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Cochran points out that, while use of small frequencies is undesirable, the 
exact choice of numbers was originally arbitrary, and that subsequent in- 
vestigations have been too few or too narrow in scope to allow definite con- 
clusions. He offers recommendations, hdwever, for a number of separate cases. 
His recommendations are based on the viewpoint that disturbances in P at 
the .05 level may be permitted to range from .04 to .06, while those at the 
.01 level may be permitted to range from .007 to .015. 

In testing for goodness of fit for bell-shaped curves such as the normal dis- 
tribution, problems arise because expectations at the tails are quite small. 
' However, little disturbance in x? at the 5 per cent level occurs even with a 
' single expectation as low as %, or 2 expectations as low as 1. This conclusion 

holds at the 1 per cent level if the number of degrees of freedom in x? ex- 
exceeds 6. Since important discrepancies will most likely occur at the tails 
in applying the test to bell-shaped curves, Cochran suggests using cells with 
as small expectations as possible without introducing disturbance. 

If all expectancies involved in a x? test are small, expectancies as low as 2 
may be used without appreciable loss of accuracy. For 2X2 tables Cochran 
indicates that Fisher’s exact test should be used if N<20, or if N is between 
20 and 40 and there are expectations below 5. If continuous data are in- 
volved large expectations should be avoided. Expectations should be kept 
down to 12 per cell for N=200, 20 per cell for N=400, and 30 per cell for 
N=1,000. 

A number of alternate and supplementary tests are discussed. A suple- 
mentary test of some interest deals with the case when systematic departure 
from theoretical distributions seem evident. The number of runs (consecu- 
tive departures from the theoretical distribution in the same direction) is the 
basis for this test. The distribution of the number of runs has been shown to 
be independent of x? and may, consequently, be used in addition to this test. 

Brozek & Tiede (10) consider the problem often faced after completion 
of an item analysis. If 5 per cent of the items are significant at the 5 per cent 
level, the statistical significance of the 5 per cent with high critical ratios is 
obviously suspect. If, in general, N significance tests have been performed 
and » show significance at some given level, how do we know whether the n 
occurrences could be attributable to chance? The probability of m significant 
results at the .05 level in N tests can be obtained by expanding the binomial 
(.05+.95)% and summing an appropriate number of terms. The computa- 
tions involved are prohibitive if N is large. However, with a large N the 
authors indicate that the binomial distribution can be approximated by the 
normal distribution with the mean equalling NP and the S.D. equalling 
NPQ. The P’s for the set of items are assumed to be statistically independ- 
ent of each other. 

One difficulty with their solution derives from the fact that the sampling 
errors in a set of item coefficients are themselves correlated (to approximately 





YUM 


viMnw 


STATISTICAL THEORY AND RESEARCH DESIGN 395 


the same degree as the items are correlated). Consequently, the assumption 
of independence of the probabilities may not always be appropriate in ap- 
plications to item analysis problems. 

Baker (7) takes various writers to task for failing to distinguish between 
joint and compound probability, when probabilities are combined from two 
samples in item selection. Compound probability is proper although, since 
compound probability is a monotonic function of joint probability, the same 
items will be selected in the end. An abac is presented for easy computation 
of compound probability. 

Gordon, Loveland & Cureton (28) contribute to the solution of the same 
problem. They present extensions of tables for x? for two degrees of freedom. 
Such tables can be used to combine the probabilities of rejecting the null 
hypothesis, when the latter is determined in two separate samples. 

Moses (44) considers the problem of testing for significance when ex- 
perimental manipulation is believed likely to produce both high and low 
scores without, possibly, having effect on the mean. He proposes that the 
significance of the difference between an experimental and control group 
can be shown by a test which he develops. After all subjects, experimentals, 
and controls are ranked in order a statistic s,* is to be computed. The least 
number of consecutive ranks necessary to include all the members of the 
experimental group except for the h least and the h greatest of them is 5,*. 
Significant values of s,* are tables for a few situations, and a formula is avail- 
able for direct computation. 

Paulson (48) investigates problems arising in the identification of the 
best treatment or control category when a number of experimental (treat- 
ment) categories are compared with a control. Specifically he seeks to devise 
a statistical test so that if none of the categories is superior to the control the 
probability that it will be selected will be 21—a. Paulson also asks the size 
of the sample needed so that, if one of the categories is really superior to all 
others by a given amount, the probability will be 21—8 that this experi- 
mental category will be selected as best. 

Solutions, both exact and approximative, are developed both for the 
case when (a) the observations are normally distributed, and (b) the ob- 
servations have a binomial distribution. Even the approximative solution 
leads to fairly involved equations and requires, seemingly, considerable com- 
putation. The problem may sometimes be important enough, however, to 
warrant added effort needed to apply Paulson’s solution. 

Suppose in a group of N individuals, each chooses d possible associates. 
Individuals not chosen are isolates and, if N and d are fixed, the number of 
isolates has often been regarded as an important characteristic of the social 
group. If it is desired to say whether the number is large or small the chance 
distribution of this number must be known. The expected number of isolates 
was previously given as 
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mean = N[(N — d — 1)/(N — 1)}9—. 


Katz (32) determines an exact expression for the distribution of a number of 
isolates and then proceeds to find an approximation that will reduce the 
computations. The variance of the distribution is 


variance= W (1-="— i. [tw-1- ~é) (4 N-2 = (sf N- + 


The nature of the distribution obtained may be judged roughly from the 
statement that a modified Poisson distribution gives a good fit. 

Burke (12) makes the very cogent point that arguments concerning the 
nature of scales are irrelevant to statistical treatments, either descriptive 
or inferential. In using statistical tests we merely wish to be in a position to 
say that we do or do not have confidence that, with repetitions, essentially 
the same results will be obtained. Stated otherwise, if we are willing to as- 
sume a universe of biased or otherwise inadequate measurements, then sam- 
ples from this universe will behave according to the same statistical prin- 
ciples that govern the behavior of samples drawn from universes of unbiased 
or more adequate measures. 


TRUNCATED DISTRIBUTIONS 


Suppose records are available giving the number of accidents per person 
for those having accidents but the number of persons having no accidents 
is unknown. David & Johnson (19) discuss various problems of estimation 
involved in such a situation and give a solution. Tables are given to aid in 
certain of the computations. Halperin (30) develops solutions to a number of 
similar truncation problems involving different frequency distributions. 


SCALING 


Edwards & Thurstone (22) describe a modification of the method of equal- 
appearing intervals together with an internal consistency check. The basic 
data are sortings by a number of judges of a number of stimuli into an ar- 
bitrary set of scale categories. The frequencies for each of the stimuli are 
expressed as proportions and used, assuming normal distribution of judg- 
ments, to estimate the location of the boundaries of the scale categories 
in relation to each of the stimuli. By subtraction several estimates of the 
widths of the boundary of each category (except the two extreme categories) 
are obtained. The average is taken to estimate the width of each boundary. 
A new scale is thus obtained and the stimuli are placed on the scale accord- 
ing to median value of distribution of judgments pertaining to each stimulus. 

Using the scale values of the stimuli and the widths of the categories, 
theoretical frequency distributions (expressed as proportions) can be deter- 
mined for each stimulus and compared with observed frequencies. Such a 
comparison was made for a particular set of data with a resulting average 
discrepancy of .025. This average discrepancy compares favorably with simi- 
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lar values reported in the literature when internal consistency checks were 
applied via the method of paired comparisons. Paired comparisons were ob- 
tained on the same set of empirical data. The correlation between the values 
obtained by the two procedures was nearly perfect. The evidence presented 
indicates rather conclusively that the two methods yield equivalent results. 

Katz (33) criticizes status indexes in current use and proposes a new 
index that takes into account the status of the persons providing the basic 
data. 

Gibson (27) shows the existence of a least squares solution for case IV 
of the law of comparative judgment. Because of the heavy computations 
involved, he does not propose that his solution be used in paractice. 


GENERAL 


Few textbooks on statistical methods of general value to psychologists 
were published during the period of this review. Lacey’s Statistical Methods 
in Experimentation (38) seems too limited in coverage to be suitable. Ander- 
son & Bancroft’s Statistical Theory in Research (3), and Rao’s Advanced 
Statistical Methods in Biometric Research (49) would be found somewhat 
heavy. Rao’s book, however, should prove useful to many psychologists. 
His verbal statements of statistical problems and use of examples are es- 


pecially helpful. A number of applications to psychological problems were 
noted. 
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COMMUNICATION! 


By GreorcE A. MILLER 
Massachusetts Institute of Technology, Cambridge, Massachusetts 


Communication presents many aspects worthy of scientific scrutiny, 
but only a few have been selected for this review. The necessary selection 
was achieved by rigid adherence to a single rule: Only those studies were 
covered that interested the reviewer. Anyone who resents the omission of his 
work should know that personal appeal, not merit, was the reviewer's yard- 
stick. 

The review begins with two linguists, Jakobson and Harris, who have 
recently published summaries of different theoretical positions. For a broader 
view of the many activities in linguistics that are related to psychology, the 
reader is referred to the report of a recent conference of anthropologists and 
linguists (19). 

ROMAN JAKOBSON 


Both psychologists and linguists are apt to be impressed by Preliminaries 
to Speech Analysis (13), but for different reasons. Psychologists will be im- 
pressed by the forceful exposition of the distinctive features and the way 
these crucial dimensions of speech are related at the acoustic, physiological, 
and psychological levels of analysis. Linguists have heard most of that story 
before, however, and they are more interested in technical points, such as 
how /p/?, /t/, and /k/ should be handled in a system of binary oppositions, 
or whether ‘‘minimum redundancy”’ is a satisfactory criterion for a unique 
phonemic solution of a given language. 

Jakobson, who is probably the outstanding linguistic theorist alive today, 
has devoted much of his theorizing to the distinctive feature. The linguist’s 
“distinctive feature’ is almost (but not quite) synonomous with the psy- 
chologist’s ‘‘discriminative stimulus.” The spoken utterances in any language 
differ among themselves in many ways, but reinforcement is contingent upon 
just a few of those differences. Those aspects of the stimulus which must be 
discriminated in order to understand a language are the distinctive features, 
and all other features are nondistinctive. For example, the utterances “‘pill’’ 
and “‘bill’’ are heard as different by people who speak English, so these peo- 
ple must discriminate voiceless /p/ from voiced /b/, or tense /p/ from lax 
/b/. Unless they respond with respect to the significant dimensions of the 
stimulus, they will not be able to communicate. A listener who ignores the 
dimensions of voicing and tenseness, but discriminates on the basis of pitch, 
will not prosper in an English-speaking community. 


1 The survey of the literature pertaining to this review was completed in May, 
1953. 
2 Phonemic symbols are indicated by placing them inside slanted lines. 
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The idea of distinctive features has been familiar to linguists for several 
years. What Jakobson has done is to shift the emphasis, i.e., to make dis- 
tinctive features, rather than phonemes, the ultimate entities of linguistic 
analysis. This shift in emphasis has some interesting consequences. One of 
them is to show that when a linguist defines a phoneme, he is reducing to a 
minimum the number of phonetic dimensions that must be used to describe 
the language. This problem is formally similar to a factor analysis problem. 
Another consequence of shifting from phonemes to features is that the rela- 
tion to psychological theories of discrimination learning becomes more ob- 
vious and direct. 

The changed emphasis also leads at once to the observation that most 
features, and perhaps all, are discriminated dichotomously. Jakobson has 
exploited the dichotomous nature of the features in his phonemic notation 
by writing ‘‘-+-” if the phoneme has one value of a feature and ‘‘—”’ if it has 
the other value. Cybernetics fans, already at home with binary logic, binary 
relays, binary neurons, binary digits to measure information, etc., are in- 
trigued by Jakobson’s binary notation for phonemes. The binary notation 
aligns speech firmly on the side of the digital computer as opposed to the 
analogue computer, as discrete rather than continuous. This discrete de- 
scription contrasts with the kind of continuous description in acoustic terms 
that one expects to get from, say, the Bell Telephone Laboratories (4, 5, 15, 
29, 30). 

The difference between a linguist’s and an engineer’s approach to the 
sounds of speech is interesting. The engineer possesses powerful analytic 
methods that enable him to distinguish quite precisely among the sounds of 
speech, or among any other sounds whatsoever. He can describe exactly the 
differences among ‘“‘ah” and ‘‘oo”’ and a clarinet tone and a frying egg. His 
trouble is that he can describe too accurately. Many of the differences he 
measures are irrelevant. He must learn to ignore many of the descriptive data 
that his analyzing machines are able to report. Therefore, he sees the problem 
as one of abstraction, as a matter of ignoring things until some essential 
property emerges as the invariant. The linguist, on the other hand, has cruder 
instruments, his ears and his memory. He can distinguish perceptually 
among sounds, and he can remember the contexts in which similar sounds 
occur. Therefore, he sees the problem as one of patterning, as a matter of dis- 
covering the structure that relates the elements. 

The engineer’s description is absolute, in terms of distance, mass, and 
time. The linguist’s description is relative, in terms of perceptual dimensions 
and contextual constraints. The linguist knows that the same physical pat- 
tern can be perceived as phoneme A in one context and as phoneme B in an- 
other context. 

A clear experimental demonstration of the relativity of perceptual judg- 
ments of speech is now available in the work of Liberman, Delattre & Cooper 
(17). They put the same acoustic patterns in front of all the different vowel 
sounds. By this systematic procedure they were able to discover, for example, 
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a consonant-like sound that listeners called ‘‘p” in front of ‘‘oo” or ‘‘ee,”’ 
but called “k’’ in front of ‘‘ah.”” The same stimulus is phoneme ‘‘p’”’ in one 
context and phoneme ‘“‘k”’ in another context. The engineer, by concentrating 
on how physically different sounds can be judged as identical, has tended to 
overlook the disturbing fact that physically identical sounds can be judged 
as different. The linguist is better prepared to let the particular stimulus be 
judged in terms of its position in a larger linguistic Gestalt. Pike (31) has 
recently described the concept of linguistic relativity in some detail. This 
relativity will make the construction of speech-recognizing machines a diffi- 
cult engineering task (4, 15). However, Licklider (18) has offered some inter- 
esting suggestions as to how a nervous system might make use of known en- 
gineering techniques—filtering and correlating—to achieve the results it does. 

In the Preliminaries to Speech Analysis Jakobson, Fant, and Halle have 
undertaken a tentative catalog of all the distinctive features. These include 
grave versus acute, compact versus diffuse, strident versus mellow, nasal 
versus oral, tense versus lax, sharp versus plain, flat versus plain, voiced 
versus unvoiced, checked versus unchecked, interrupted versus continuant, 
vocalic versus nonvocalic, and consonantal versus nonconsonantal. For each 
feature the reader is told what acoustic properties of the stimulus are involved, 
how those properties are produced by the vocal apparatus, how the feature 
is perceived, and in what languages the feature is known. At many points in 
the survey the evidence is fragmentary. Probably the details will be revised. 
But the broad outline into which these details must fit is clearly prescribed. 

An important consequence of the emphasis on features rather than pho- 
nemes is that it encourages the linguist to accept the psychological nature 
of many linguistic problems. A willingness to use psychological techniques 
distinguishes Jakobson from Harris. The difference is worthy of elaboration. 


ZELLIG S. HARRIS 


In his Methods in Structural Linguistics (9) Harris describes a rather dif- 
ferent kind of analysis that emphasizes ‘“‘distributional techniques.” The 
“distribution”’ of a particular linguistic element is the set of all contexts 
which will become utterances in the language when that element is substi- 
tuted into them. In order to give the flavor of the difference between Harris 
and Jakobson, a fictitious example is useful. 

An exotic language has three kinds of phonetic segments (physiological 
coordinations or acoustic spectra) that are designated A, B, and C. People 
who speak this language have been heard to say only BA, AC, and AA. 
Examination of these utterances (the ‘‘corpus’’) leads to the discovery that 
B and C never occur in exactly the same context. They have ‘“‘complementary 
distributions’: B occurs only before A and C occurs only after A. When this 
situation arises, it is unnecessary (or “‘redundant’’) to retain a distinction 
between the segments so distributed. It is possible to call both B and C by 
the same name, say D, so that the three utterances become DA, AD, and AA, 
respectively. The two segments B and C, although phonetically distinguish- 
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able, have now been classed together as variant forms (‘‘allophones”’) of the 
same phoneme, C. (If A and B or A and C are combined, however, confusions 
will result because their distributions are not complementary.) Notice that 
the principle of complementary distribution makes no reference to meaning 
or to any nonlinguistic evidence. In fact, nothing is said even about the sound 
of the three segments. 

Harris advocates the use of distributional methods to discover the basic 
units, or phonemes, of any language. (Similar techniques are also advocated 
for isolating morphemes, but that half of Methods in Structural Linguistics 
will not be considered here.) The method can squeeze out much of the re- 
dundancy of a phonetic transcription without reference to phonetics or 
semantics. It has the disadvantage that the solution is not unique and that a 
subsidiary principle of ‘“‘phonetic similarity” is usually required to rescue the 
solution from minor absurdities. And the method takes for granted the prior 
existence of an accurate phonetic transcript, a document that could not exist 
without the prior segmentation of the utterances into discrete temporal units. 

For comparison, consider Jakobson’s approach. First, construct sets of 
utterances that are identical except for minimal segments: BA and AA are 
identical except for their initial segments. The existence of these sets of ut- 
terances is proof that the language contains an initial opposition between B 
and A and a final opposition between C and A. The fact that BA and AA 
produce different behaviors from these people indicates that B and A must 
be distinguishable (and similarly for C and A). Second, examine the phonetic 
nature of the oppositions. If any of the phonetic features are common to B 
and C but not to A, so that a single feature can distinguish both B from A 
and C from A, then B and C can be equated in a single phoneme, D. 

Thus Jakobson and Harris reach the same result, but by quite different 
routes. Jakobson stresses the importance of the initial segmentation (which 
probably requires a knowledge of the meanings of the language) and the 
phonetic features of the utterances, while taking distribution for granted. 
Harris emphasizes distributional analysis, and takes the initial segmentation 
and the phonetic similarities more or less for granted. 

In his attempt to purify his methodology, Harris would like to eliminate 
the intuitive, psychological concept of meaning from linguistics. The meaning 
can give useful hints that help the analyst reach a solution more rapidly, 
but for complete rigor the same solution should be attained by considering 
the distributions in very large contexts. On the other hand, Jakobson might 
proceed without knowing the meanings, but he has little desire to try. Wheth- 
er or not a linguist can work without meanings, his job is certainly simpler 
if he knows them. In any event, the kind of meanings that the linguist needs 
can be defined with considerable objectivity in terms of behavioral discrim- 
inations. No appeals need be made to subjective intangibles in order to use 
meaning in a linguistic analysis. However, the spirit of Harris’ work is strong 
in many American linguists, and the Methods summarizes a self-consciously 
scientific approach to language. 
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Harris’ position that context can, in sufficient quantity, substitute for 
meaning has some interesting repercussions if it is generalized beyond the 
realm of linguistic description. It is a general observation that events taken 
out of context are ambiguous, and that their true meaning is usually not 
clear until related facts are known. Does this imply (shades of E. B. Tit- 
chener!) that the context is the meaning? The relation of context to meaning 
is a slippery but interesting one that arises repeatedly in studies of communi- 
cation. Some recent work by Kaplan throws a little light on the relation. 


ABRAHAM KAPLAN 


When a philosopher does psychological experiments in linguistics to solve 
an engineering problem, the interdisciplinary character of modern communi- 
cations research is boldly revealed. As a logician and philosopher, Kaplan is 
familiar with the ambiguities of natural languages. An incentive for studying 
linguistic ambiguity was provided by the problem of engineering a computer 
that could translate mechanically from one language to another. The meth- 
ods Kaplan adopted were purely psychological (14). 

A list of words was presented in isolation and in contexts selected from 
sentences in which the words had occurred. The contexts were: 


Pi one word preceding, 

F1 one word following, 

B1 one word preceding and one following, 

P2 two words preceding, 

F2 two words following, 

B2 two words preceding and two following, and 
S_ the complete sentence. 


Judges were asked to indicate which one of several alternative definitions was 
correct. 

With. no context there were on the average about 5.5 senses that the 
words might have. Context reduced this number. For example, accurate 
judges were able to reduce the number of senses as follows: to 59 per cent for 
P1; 51 for F1; 46 for B1; 48 for P2; 52 for F2; 36 for B2; and to 26 for S. P1 
was least effective. B1 was more effective than P2 or F2. B2 was almost as 
effective as S. 

A short verbal setting can reduce the ambiguity of a word on the average 
from 5.5 possible senses to about 1.5 or 2.0 senses. Most of this reduction is 
contributed by context not more than two words away. Kaplan’s result seems 
to mean that there is a high probability that a B2 context will include some 
substantive word (one of semantic rather than mere syntactic function) that 
will give a clue as to which sense is appropriate. That is to say, the context 
probably acts quite specifically and not by gradual accretion of meaning as 
words are added to the context. 

The engineering significance of these results is that a translating machine 
needs to remember only five words of text at a time. The status of mechanical 
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translation schemes has recently been reviewed by Bar-Hillel (2) and will not 
be pursued here. 


CLAUDE E. SHANNON 


The question of context appears in a different light in the work of Shan- 
non. He considers the problem of a communications engineer who wants to 
send as much information as possible over a communication channel that 
has a finite capacity. If the context of an item tells something about that 
item, then the context effectively repeats the information in the item. This 
repetition of the same information by the symbol and its context is redundant 
and inefficient. In order to use a channel efficiently, an engineer would like 
to remove this redundancy so that every symbol could carry the maximum 
amount of information. 

Shannon’s measure of redundancy is, therefore, a measure of covariance 
between the verbal elements and their contexts, a measure of how predictable 
the future of a message is when only its past is known. It is intuitively ap- 
parent that the more history that is given, the more predictable the future 
can be. That is to say, if you can predict the next letter with a given level of 
accuracy when you know 7-1 preceding letters, then on the average you 
will not predict less accurately if you know 1 preceding letters. In order to 
obtain additional accuracy from the additional unit of context, however, it is 
necessary to know the statistical properties of the message source. In order to 
calculate how much redundancy (or predictability) is introduced by a con- 
text of n-1 preceding letters, you must know the probabilities for all possible 
sequences of length m. As increases, the number of possible sequences goes 
up rapidly and the feasibility of actually measuring all the necessary prob- 
abilities goes down at the same rate. Thus it seems as though the theory of 
information requires a calculation that is impractical, if not impossible. 

Shannon’s most recent contribution to this problem, however, has been a 
short-cut method for estimating upper and lower bounds on the amount of 
redundancy (35). Like Kaplan, Shannon resorted to a psychological experi- 
ment. He avoided the statistical labor that is required in theory by using a 
simple guessing game. His assumption is that people who know a language 
will guess approximately in accordance with those otherwise unknown prob- 
abilities. We will consider one of the bounds that Shannon developed. 

Shannon’s argument can be presented as follows. Imagine identical twins 
who have exactly the same guessing habits about language. The first twin 
sends a message to the second, say, ‘‘There is no reverse on a motorcycle.” 
Now, because their guessing habits are the same, the transmitting twin 
knows that the first guess his receiving brother will make for the first letter 
is ‘‘t.’’ Instead of sending ‘‘t,’’ thérefore, he can send “‘1,’’ meaning ‘‘ Your first 
guess is right.” If the receiving twin is wrong on the first 6 guesses, the trans- 
mitter just sends ‘‘7,”” meaning ‘Your seventh guess is right,’’ etc. By recod- 
ing the original message into guessing numbers, the twins arrive at a new 
sequence of symbols: 
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THERE—IS—NO—REVERSE—ON—A—MOTORCYCLE 
becomes 
EEPSVLZIEL SLEW ALIS 2 AF ATFACLASE De. 


For the twins, the two sequences of symbols contain exactly the same infor- 
mation (one can be recoded into the other, and vice versa), so they must con- 
tain exactly the same amount of information. Note, however, that successive 
symbols in the sequence of digits are effectively independent; the fact that it 
took 6 guesses to get this letter does not say anything about how many guesses 
will be needed for the next letter. The sequence of digits, therefore, has no 
contextual constraints, the context repeats none of the information given by 
the individual digits. It is an easy matter to calculate how much information 
per symbol there is in the digit sequence because no knowledge of probabili- 
ties for long sequences is required. So Shannon computes the amount of 
information per symbol in the nonredundant digit sequence and assumes that 
this is equal to the amount of information per symbol in the redundant se- 
quence of letters. The difference between this estimate of information actu- 
ally encoded and the theoretical calculation of how much could be encoded 
with no redundancy provides Shannon’s measure. It is usually convenient 
to transform into percentages. 

The experiment ran as follows: A sequence of m-1 letters (n=1, 2,---, 
100) was selected at random from English prose writing and presented to a 
judge. The judge read the n-1 letters and then guessed at the n-th letter. If 
he was wrong, he guessed again. He continued to guess until he got the right 
letter. After this procedure had been repeated for several contexts of length 
n-1, Shannon had a frequency distribution for the number of times the first 
guess was right, the number of times the second guess was right, the third 
guess, the fourth, etc. From those frequencies he estimated g;, the prob- 
ability that guess 7 is correct when a context of n-1 preceding letters is known. 
Shannon used only the 26 letters of the alphabet and the space, soz runs from 
1 to 27. The amount of information per letter in the original message. is, 
therefore, 

27 

A, = — DY gi™ logs gi 

tl 
The theoretical maximum information per symbol that could be obtained 
with a 27-letter alphabet is log. 27. The difference between this maximum and 
the estimate, H,, measures the amount of redundancy involved in a context 
of n-1 letters. [Pollack (32) has used this technique in a study of the memory 
span]. 

Translated into percentages, the function Shannon obtained was 
n: 1 2 3 4 #5 6 10 15 100 
Per cent Redundancy: 0 15 25 35 33 45 48 48 68 


Thus 15 per cent of the information conveyed by a letter is also contained 
in the letter immediately preceding it; 25 per cent is contained in the two 
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preceding letters; 68 per cent is contained in the 99 preceding letters. A simi- 
lar but somewhat flatter curve was later estimated by Newman & Gerstman 
(24), by a different method. 

Here again it is found that contextual effects decline rather rapidly with 
distance from the target item. If one remains in the realm of verbal context, 
the redundancy is somewhere between 50 and 80 per cent and most of this is 
contributed by context within a span of about 10 letters. It is also possible, 
however, to supply the nonlinguistic, or situational, context of the communi- 
cation. Frick & Sumby (6) used Shannon's technique to estimate the redund- 
ancy in spoken messages sent from an airport control tower to an approach- 
ing airplane. With only the verbal context, these messages were 80 per cent 
redundant. With situational context added, the redundancy was 96 per cent. 
The tower was sending only four per cent as much information as it could 
have sent with an efficient language. 

In comparing these results with Kaplan's, however, it is necessary to re- 
member that Shannon is interested in how much the context determines 
what the message can be, whereas Kaplan wants to know how much the con- 
text determines what the message can mean. Confusion between the amount 
of information, in Shannon’s sense, and the meaning of the information, in 
Kaplan’s sense, has led to a certain amount of nonsense in the cybernetic 
literature. Carnap & Bar-Hillel have recently endeavored to clarify the dif- 
ference. 


RUDOLPH CARNAP 


Shannon’s measure of the amount of information in a message is the 
negative logarithm of the probability that the message will occur. The less 
probable the message, the more information it contains. Of the two sentences, 
“This is a pencil,’”’ and, ‘‘This is a writing implement,” the first is more prob- 
able to occur in normal, conversational English. Therefore, according to 
Shannon, the first message must contain less information than the second. If 
we examine the meanings of these two sentences, however, Shannon’s meas- 
ure seems wrong. ‘‘This is a pencil’ implies that it is a writing implement, 
but “This is a writing implement’”’ does not say whether it is a pencil, pen, 
typewriter, etc. Semantically, therefore, the first message contains more in- 
formation than the second. 

Obviously, there are two different senses of ‘‘amount of information” 
involved in this example. The engineer’s sense, which ignores meanings and 
concentrates on the probabilities of the different messages that the channel 
will need to transmit, has been beautifully developed by Shannon and his 
colleagues. The application of this engineering measure to psychology has 
been reviewed by Miller (23). The second sense, which concentrates on the 
meanings of the messages, has received far less attention. It is the second 
sense of ‘‘amount of information” that Carnap and Bar-Hillel have tried to 
develop in their Outline of a Theory of Semantic Information (3). 

If sentence X logically implies sentence Y, then the information carried 


XUM 


XUM 


COMMUNICATION 409 


by X includes the information carried by Y, and possibly more. This is the 
necessary condition that Carnap and Bar-Hillel impose upon the concept of 
semantic information. The detailed development of the idea is beyond the 
scope of this review, but some of the obvious consequences can be stated 
here. If X implies Y and also Y implies X, then X and Y contain equal 
amounts of information. A tautology (e.g., ‘‘Today is either Tuesday or it is 
not Tuesday”’) is logically implied by every sentence and so contains no in- 
formation. A logically false sentence (e.g., ‘‘Today is both Tuesday and not 
Tuesday’’) logically implies all other sentences and so contains the maximum 
amount of information—semantic information is not necessarily truthful in- 
formation. The amount of information in the conjunction of two sentences is 
equal to or less than the amount of information in the first plus the amount 
of information in the second (equality holding only if two sentences are logi- 
cally disjunct). 

The theory of semantic information is a ramification of the theory of 
inductive probability developed by Carnap. Psychologists interested in the 
application of information theory to psychology owe themselves a careful 
perusal of this work. If nothing more, it should clarify some of the limitations 
of Shannon’s theory. 

An interesting question for a semantic theory of information is whether 
there is a semantic equivalent to the engineer’s concept of noise. For example, 
if a sentence can have two meanings and if one meaning is intended by the 
talker and the other is received by the listener, then there is a kind of noise 
in the communication even though the physical signals were transmitted 
perfectly. Carnap and Bar-Hillel wish to call this ‘‘semantic noise.’? Kaplan’s 
work, for example, might be characterized as a study of the extent to which 
context reduces semantic noise. In many situations, however, it is difficult or 
impossible to decide whether the receiver’s misunderstanding is due to a 
semantic misevaluation or to a random error in perception or response. The 
question of human errors in communication systems is currently receiving 
close study by psychologists. 


Witiiam J. McGILi 


Communication engineers usually have two human beings in their ideal- 
ized systems, a source and a destination for messages. Between them is the 
equipment that provides the channel. The psychologist’s idealized system 
exactly reverses the positions. The psychologist has equipments that gen- 
erate stimuli and record responses. Between them is a single human being 
who provides the channel. The first clear statement of this inversion was 
supplied by Garner & Hake (7). 

The measure of transmitted information becomes, in the psychological 
application of Shannon’s theory, a measure of the covariance of stimuli and 
responses. The channel capacity defined by the engineer becomes the sub- 
ject’s discriminative capacity. The subject’s behavior upon stimulation is 
recorded and analyzed without reference to what the stimuli mean. This en- 
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gineering approach to the discriminative processes (and even to memory) 
offers considerable promise and is being pursued vigorously by experimental 
psychologists (1, 6, 7, 8, 10, 11, 12, 14a, 14b, 16, 22, 23, 24, 32, 33, 34). 

McGill (22) has recently contributed an important extension to the 
Shannon-Garner-Hake development. He has shown that the measure of 
transmitted information closely resembles analysis of variance. The total 
information in the subject's responses can be analyzed into the sum of sev- 
eral components, one for each stimulus variable used, plus an error (or noise) 
term. In this way an experimenter can show which of the stimulus variables 
influenced the responses. With McGill’s model each stimulus dimension can 
be regarded as a separate (but not necessarily independent) input to the 
subject. For example, it should be possible to tease out Jakobson’s distinctive 
features by applying McGill’s analysis to speech-perception data. 

According to the Shannon-Wiener definition, 


H(x) = — DO pn log pn, 


is the measure in bits of the amount of information per event drawn from 
the discrete population described by the probabilities p,. Only one random 
variable, x, is involved. When two random variables, x and y, are observed 
it is possible to define T(x,y), a two-dimensional quantity, in addition to 
H(x) and H(y). If x is the input to a communication system and y is the out- 
put, then 7(x,y) is a measure of the transmitted information, i.e., of the 
covariance of output with input. By definition, 


T(x,y) = H(x) + H(y) — A(x,y) 


where H(x,y) is computed from p, x, the probabilities for the joint occurrence 
of a given input value with a given output. 

The one- and two-dimensional quantities, H(x) and T(x,y), are well 
known as a result of Shannon’s work. McGill has simply continued the argu- 
ment for more than two random variables. For example, x and y are two dif- 
ferent stimulus inputs and gz is the organism’s response. Then H(x), H(y), 
and H(z) are the amounts of information in the two stimuli and the response. 
T(x,y) is the covariance of the two stimuli, T(x,z) is the transmission of in- 
formation about x, and 7(y,z) is transmission about y. Now it is possible 
to define a three dimensional quantity, 


A(xyz) = — H(x) — H(y) — A(z) + A(x, vy) + A(x,2) + A(y,2z) — H(x,y,2), 


so that all the various quantities of information are combined in a single ex- 
pression. McGill points out that A (xyz) is closely related to the three-way in- 
teraction term in analysis of variance. 

By some algebraic rearrangement it can be shown that 


H(z) = H,z,(z) + T(x,2z) + T(y,2z) + A(xys), 


where H,,(z) = H(x,y,z) —-H(x,y). In this equation H(z), the output or re- 
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sponse information, is analyzed into several components: (a) Hy,(z) is 
‘‘noise,"’ the response information that cannot be attributed to either input 
x or input y; (b) T(x,z) is the response information attributable to input x; 
(c) T(y,z) is response information attributable to y; and (d) A(xyz) is that 
part of the response information that can be attributed to x by virtue of the 
fact that y is also known (or conversely, to y because x is known). This parti- 
tioning of response information into components parallels quite closely the 
analysis of variance into an error term, two main effects, and an interaction 
term. It would seem that information theory effectively corresponds to a 
nonparametric analysis of variance. 

Although most of McGill’s results were foreshadowed in the one and two 
dimensional cases, there is one rather unexpected result. All quantities of in- 
formation with one or two discrete random variables must be nonnegative. 
However, the three-variable term, A(xyz), can be negative. In other words, 
a knowledge of input y may decrease the amount of information that z has 
about x—communication from x to z would actually be better if no data 
about y were collected at all! This interesting feature of informational an- 
alysis is not found in analysis of variance, since variances are always non- 
negative. The A-term is not a measure of information in the sense that H- 
terms and T-terms are. The A-term measures the gain (or loss) in trans- 
mission that results from considering the several stimuli together instead of 
one at a time. 

A communications engineer has ways of measuring the noise that inter- 
feres with his transmissions. A psychologist who tries to apply communica- 
tion ideas, however, has no such independent estimates of the psychological 
noise. All he can say is that some fraction of the response output is signal 
(is attributable directly to the known stimulus inputs). The remainder is, 
presumably, random noise. What McGill does is to break the noise term into 
parts that can be attributed to different sources. If he knows where some of 
the psychological noise comes from, his chances for getting rid of it are con- 
siderably improved. 

In psychological applications it is perhaps a limitation of the engineering 
theory that semantic relations are ignored. The distinction that Carnap and 
Bar-Hillel propose between ‘‘semantic noise’ and ‘‘random noise’”’ is intuit- 
ively appealing, but the Shannon-Garner-Hake-McGill theory contains no 
rules for distinguishing these two kinds of noise. After McGill has taken out 
all the noise that he can correlate with various stimuli, the residual error that 
the person introduced remains untouched. It may be either semantic or 
random. 

Possibly the Carnap theory and semantic noise can be applied to certain 
psychological problems in a manner analogous to the way Shannon's theory 
and random noise are used to study discriminative problems. In fact, Hov- 
land’s ‘“‘communication analysis’ of concept formation (12) suggests one 
direction this application might take. 
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It seems that everyone who studies communication becomes involved in 
problems of meaning and the more involved he becomes, the slower he gets 
ahead. The psychologist who unravels the meaning of meaning will break 
the worst bottleneck in communications research. It is conceivable that 
Osgood may do just that. 

Osgood (28) has proposed a ‘‘mediation hypothesis” based on some early 
Hullian ideas in order to explain how it is that something which is not the 
object becomes a sign of the object. A sign, he argues, evokes a mediating 
reaction that is a fractional part of the total behavior elicited by the object. 
Furthermore, the mediating reaction produces self-stimulation which in turn 
becomes conditioned in varying strengths to different instrumental responses 
or response sequences. This mediating reaction is what Osgood refers to as 
the meaning of the sign. [A more recent elaboration adds grammatical func- 
tions to this purely semantic mechanism (see 19, p. 51).] A sign can elicit 
more than one mediating reaction; it will tend to elicit other mediating re- 
actions in proportion to their similarity to the original reaction, and it will 
inhibit others in proportion to the directness of their antagonism to the ori- 
ginal reaction. 

Whether or not this translation of ‘‘meanings”’ into “‘mediating reactions”’ 
really converts the horse fly into a flying horse need not be decided in this 
review. In the reviewer’s opinion, however, the more significant half of Os- 
good’s contribution is his technique for measuring meanings, a technique 
that grew out of work on synesthesia by way of attitude measurement. Over 
this circuitous path, Osgood arrived at a kind of multidimensional psycho- 
physical method that may be applicable to a wide variety of complicated 
psychological problems. 

The ‘“‘semantic differential” is a combination of association and scaling 
methods. A judge is asked to indicate the position of a concept (e.g., ‘‘paci- 
fist’’) along an experiential continuum defined by a pair of polar terms (e.g., 
“kind-cruel’’). This indication can be made in terms of a seven-point rating 
scale or by measurement of the judgment time required. The judgment is 
repeated by a group of judges for all combinations of several different con- 
cepts and polar continua. 

The polar continua can be regarded as the coordinates of a ‘‘semantic 
space.’’ Each concept is assigned experimentally a value along each coordin- 
ate, and so can be specified as a point in the semantic space. Points that are 
near together in the space should be semantically similar, for they evoke 
nearly the same judgments on all the polar continua. The distances in this 
space are a little tricky, however, for the coordinates are probably not all at 
right angles (that is to say, are not independent). Factor analytic techniques 
can be used to isolate a limited number of general coordinates of the se- 
mantic space that will have the maximum differentiating power. Factors dis- 
covered so far include an evaluative factor (good-bad), a strength factor 
(strong-weak), an activity factor (active-passive), and several specific 
factors. 
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It is no great trick to criticize this technique of measurement, but the 
fact remains that it is the most promising approach yet proposed. Semantic 
conditioning, associative interference, word associations, tachistoscopic 
recognition times, and the like are even more limited in scope than the se- 
mantic differential. For example, Noble’s (25, 26, 27) recent work, in the 
tradition of Glaze, Hull, Krueger, Witmer, etc., is not concerned with the 
measurement of meaning, but with the more limited problem of measuring 
meaningfulness. That is to say, when Noble reports that ‘‘maelstrom’’ and 
“tumbril’’ evoked the same number of associations per minute from his 
subjects, he argues that ‘‘maelstrom” and “‘tumbril’’ have the same meaning- 
fulness. He would not claim that they have the same meanings. For two 
meanings to be the same, the associations should not only be equal in num- 
ber, but identical in content. Osgood proposes to measure the meanings 
themselves, which is a more ambitious and potentially more useful project. 


BENOIT MANDELBROT 


A relation between frequency and rank order, often called ‘‘Zipf’s Law,”’ 
has turned up over and over in statistical studies of language. Zipf formu- 
lated the law as follows: 


pn = Pn-® 


when 1 is the rank of a word when all words are ordered by decreasing fre- 
quency of use, p, is the probability that the word at rank » will be used 
and P and B are constants. Although this relation appears with regularity 
in linguistic data, no one has claimed more than a vague appreciation of its 
cause or significance. No one, that is, until Mandelbrot (20). 

Mandelbrot’s argument is complicated and even an over-simplified 
version requires several steps for its development. The first step is to find 
the assignment of probabilities », to the different words that will make the 
average cost of transmitting a word as small as possible, subject to the side 
condition that the average amount of information per word is a constant. 
There is some cost associated with communication. Let j, represent the cost 
of transmitting the m-th word. Then Dd Prin is the average cost per word. As- 
sume that the cost of each word is given, but the frequencies of use can be 
varied at will by the transmitter. What frequencies will minimize his average 
cost? Obviously, he should always send the cheapest word. This answer, while 
cheap, has the disadvantage that an unchanging message conveys no infor- 
mation to the receiver. It is necessary to impose the auxiliary condition that 
the amount of information, measured by —>o hn log Pa, must bea constant. 

Any mathematician (and many nonmathematicians) can solve this prob- 
lem with a trick devised by Lagrange. He introduces some dummy constants, 
called ‘‘Lagrange multipliers,’’ and comes out with the result that 


pr = PeAin, 


where P and A are constants introduced in order to obtain the solutions for 


Pn. 
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The answer to the first problem, therefore, is that the p, should be se- 
lected so that the cheapest words are used most and the most expensive 
words are used least. The greater the cost jn, the smaller must be the prob- 
ability »,. This answer seems intuitively correct and coincides with the com- 
mon linguistic observation that frequent words are usually short (presum- 
ably cheap) and rare words are usually long (presumably expensive). The 
answer is not completely satisfying, however, because it does not contain a 
clear idea of what the cost might be or how the costs of different words could 
be measured. The second problem, therefore, is to examine the costs jn. 

Mandelbrot considers this situation: The language has many more 
“words” than “‘letters,’’ so each word must be represented by a sequence of 
letters. Furthermore, the communication system that Mandelbrot wants to 
study can remember only one word at a time and must transmit a sequence 
of letters for the present word before it can begin to encode the next word. 
Furthermore, no letter can be transmitted without cost. Some letters may 
be cheaper to send than others, but none are free. 

Under these conditions, the cost of a word is given as the sum of the costs 
of each of the letters in the sequence of letters used to represent that word. 
Words, however, are delimited by spaces (or pauses) that indicate where 
the word ends. To the cost of each word, therefore, it is necessary to add 
the cost of the space that is used to separate it from the next word. Although 
it may seem trivial, the introduction of the space between words is the crux 
of Mandelbrot’s contribution and the main feature that leads him to results 
different from Shannon’s. In Shannon’s problem, the entire message is re- 
membered and then coded in the most efficient form for transmission. In 
Mandelbrot’s problem, the message is remembered only one word at a time, 
so that every time the space occurs the transmitter makes the most efficient 
coding he can of that word and then begins anew on the next word. Obvi- 
ously, a transmitter of the kind Shannon studied will be more efficient, but 
one of the kind that Mandelbrot is studying will be more practical. 

Mandelbrot has shown that, for any assignment of costs to the individual 
letters, the relation between the cost of the word j, and its rank » is given 
(at least to a first approximation) by 


jn = logy n. 


For example, suppose all letters are equally expensive, so that the cost of the 
word is given by its length (plus one for the space at the end), and suppose 
there are M different letters available in the alphabet. Then there are M 
different, one-letter words which are the cheapest. Next cheapest are the M? 
different, two-letter words, then the M® three-letter words, etc. Therefore, 
when all words are ranked from cheapest to most expensive, the rank 7 in- 
creases exponentially as the cost (or length) 7, increases linearly. The fact 
that the same kind of relation holds approximately when the different let- 
ters have different costs is less obvious but no less true. 

The result of the second step in Mandelbrot’s argument, therefore, is that 
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the cost of a word will be given approximately by the logarithm of its rank 
order with respect to increasing cost. This result is true under very general 
conditions and is not a unique outcome of some particular assignment of costs 
to letters. It reflects the two facts: that, in general, if the cost of a word is the 
sum of the costs of its letters, then the more letters it has the more it is apt 
to cost, and that it is not possible to construct as many short, cheap words 
as there are long, expensive words. If these two facts are accepted, then the 
logarithmic relation of cost to rank order is a direct consequence. 

The third step in the argument is to substitute the logarithmic solution 
for jn into the earlier equation relating word probabilities to word costs: 


pn = Pe~Ain = PM~-F in = Pn-® 


And there, mirabile dictu, is Zipf’s law. In fact, Mandelbrot is able to im- 
prove upon Zipf’s law. If the relation of word cost to rank order is solved 
more precisely, the second-order approximation gives 


jn = Jo + logy (n + m). 
Wher this result is substituted into the first equation for pn, 
Pn aad P(n + m)-8, 


where m is another constant. It is clear that m will have the greatest in- 
fluence when the rank m is small—it makes the curve somewhat flatter for the 
commonest words. This modification of the law enables Mandelbrot to de- 
scribe the empirical results much more accurately. 

What does Mandelbrot’s result mean? He says that if one wants to com- 
municate efficiently word-by-word, then one must obey Zipf’s law. There is a 
strong temptation to reverse the implication and to argue that because we 
obey Zipf’s law we must therefore be communicating word-by-word with 
maximal efficiency. The work on the effects of context seems to indicate that 
a speaker’s encoding process involves a longer memory span than Mandel- 
brot postulates for his word-by-word encoder. And the calculations of the 
actual redundancy in natural languages indicate that they are far less effi- 
cient than they could be under the constraints Mandelbrot imposes. The 
question of redundancy is a technical one, however, and the interested reader 
can best pursue it in Mandelbrot’s monograph. 

One interesting suggestion that comes out of this work is a reinterpreta- 
tion of Hick’s data for multiple choice reaction times. W. E. Hick (10) re- 
ported that 


t, = 0.27 log. (n + 1) 


described the relation of disjunctive reaction times to the number » of alter- 
native responses where n ranged from 1 to 10 fingers. For simple reaction 
times, » =1, Hick was led to argue that the brain makes a binary choice be- 
tween the presence and the absence of a signal at any instant. With this as- 
sumption he was then able to say that reaction times are directly proportional 
to the amount of information needed to make the correct response. Mandel- 
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brot was struck by the similarity of Hick’s function to the cost function, 
stated above, when j)>=0 and m=1. That is to say, Mandelbrot thinks that 
the reaction time is a measure of the cost of making the discrimination. This 
thought implies that the disjunctive reaction time can also be used in verbal 
perception experiments as a measure of the cost of a word. Since rare words 
are theoretically more costly than common words, rare words should take 
longer to read, as indeed they do. There is no reason why Mandelbrot’s log- 
arithmic cost function and the reasoning behind it should not be generalized 
to a variety of reaction time and latency experiments. 

Mandelbrot’s thesis takes him into game theory, mathematical statistics, 
thermodynamics, psychology, linguistics, and communication theory. His 
general frame of reference comes from the theory of games; transmitter and 
receiver form a coalition against nature in order to minimize the information 
lost during the communication. In these general terms, the cost of a message 
is the amount of information lost in transmitting it. 


Jacop MArscHAK 


A tie between communication and economic theory has been effected in 
a different way by Marschak (21). Whereas Mandelbrot proposes to con- 
sider information as the currency in a game between men and nature, Mar- 
schak considers information as a commodity that can be bought and sold in 
the market place. 

Marschak’s concern is to generate an economic theory for teams. Here we 
will consider only a small fragment of this ambitious program, the question 
of how much a person or a team should be willing to pay for information. 
The direction of Marschak’s thought can be indicated by some examples. 

A dollar is bet on the toss of a coin. The probability of the coin falling 
heads is known to be ». If the result of the toss is not known, then the best 
bet is heads if p is greater than 0.5, or tails if pis less than 0.5. That is to say, 
over a long series of trials a smart gambler can expect to win pq dollars per 
bet if p>0.5, or g-p dollars per bet if p<0.5. This fact can be expressed 
by writing that his expected value Uy =| p—q|. Now this question can be 
asked: How much should the gambler be willing to pay for information about 
the outcome of a toss prior to placing his bet? If he knows the outcome in 
advance, he can win on every toss, so the expected value now becomes 
U,=1. The difference between his expected values with the information and 
without it, U;— Uo, is the amount that the gambler should be willing to pay 
for the information. In this example, the value of an inquiry is w=1— | p-q| : 

Note that the value of inquiry is similar, but not identical to the amount 
of information (in Shannon’s sense) that the inquiry produces. In this exam- 
ple, the amount of information is given by —p log p—q log gq. Both the 
amount and the value are maximal when pis 0.5 and minimal when pis 0 or 1. 
Therefore, Marschak agrees with Mandelbrot in spirit if not in detail. Both 
theorists treat cost and amount, dollars and bits, as closely related measures 
of information. 
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Marschak’s notion generalizes readily to the value of inexact information. 
Imagine that, in the preceding example, the gambler had the opportunity of 
buying information that he knew was correct only 90 per cent of the time. 
If he buys this information and uses it, his expected value is U;=0.8 dollars 
per bet (in 10 bets, he expects to win 9 and lose 1). Therefore, the value of in- 
quiry becomes U;— Up=0.8— |p—q|. When 4, the probability of heads, is 
less than 0.1 or more than 0.9, the gambler will lose money if he buys the in- 
exact information. Under these conditions he will decide not to buy. In a 
sense, Marschak offers the gambler his choice of two games, both with the 
same opportunities for profit. In one game he gambles on his knowledge of p. 
In the other game, he pays a price to get a safer bet. In the general case, 
the gambler may be permitted to buy information of any degree of precision 
that he chooses. 

The value of inquiry can also be defined for multidimensional situations 
where the event is characterized by several aspects. For example, if future 
weather is one dimension and future prices another, a farmer might want to 
buy information about either or both of these aspects before he decides what 
to plant. If the several aspects of such a complex situation are not independ- 
ent, then there is not so much advantage in knowing both over knowing only 
one. In this way the value measure generalizes to dependent probabilities as 
well. 

A simple example may suggest how these ideas are related to the problem 
of team organization. Imagine that two partners run a shoe factory. Partner 
A is an expert on leather prices, partner B on shoe prices. A can commit the 
firm to buy leather, and B can commit the firm to sell shoes. Now suppose 
that A and B try to run their firm without talking to each other about leather 
prices or shoe prices. Then the only way they can operate is for A to buy 
leather only if the cost is below average and for B to sell shoes only when the 
price is above average. By failing to communicate, the firm will miss the 
chances it has to make a profit when leather is expensive but shoes are more 
so, or when shoes are cheap but leather is cheaper. The profit that the firm 
can expect if the partners operate independently is Uo and the larger profit 
they can expect with two-way communication is U;. Here, as before, w= U; 
— Ugis the value of communication to the firm, the amount that they should 
be willing to spend for telephones, conferences, etc. Cases of considerable 
interest arise if we consider the possibilities of one-way communication, the 
effects of correlation between the two price variables, and the consequences 
of greater variability in one price than the other. 

Marschak’s development of the value of inquiry is in itself a useful con- 
tribution to communication theory, and the possibilities it suggests for opti- 
mizing team organizations look quite promising. However, all the stock 
criticisms raised by psychologists against normative economic theory can 
also be raised against the type of work represented by Marschak’s paper. 
Perhaps social psychologists will be more grateful for a normative approach 
to groups than their experimental colleagues have been for a normative ap- 
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proach to choice and motivation. A purely descriptive science of group be- 
havior is certainly a tedious alternative. 


CONCLUSIONS 


Perhaps the reader of this review has been struck by a persistent and re- 
curring theme in nearly all the recent work reported here. This theme has to 
do with the question of context. In linguistics there is a real division of opin- 
ion about how far context can be used to replace meaning in linguistic an- 
alysis. Kaplan wanted to relate context and ambiguity. Shannon wanted to 
relate context and redundancy. McGill resolved psychological noise into 
components by a careful examination of contextual stimuli in the experi- 
mental situation. Mandelbrot examined the effect of limiting the amount of 
context used during encoding to one word at a time. After years of neglect, 
verbal context has now become a very popular problem. 

A second theme that runs through most of the recent work is an old fav- 
orite. The treatment of meaning is an important aspect of the work by Ja- 
kobson, Harris, Kaplan, Carnap, and Osgood. 

Finally, a third trend in this work is reflected in the many attempts to 
optimize something. Kaplan wants to minimize ambiguity, Shannon wants 
to minimize redundancy, McGill wants to maximize transmission, Mandel- 
brot wants to optimize the encoding procedure, Marschak wants to find the 
best rules for communication within teams. This normative approach is at- 
tributable in part to engineering, in part to economists, and it makes many 
psychologists uncomfortable. 
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basic varieties of, 240 
carbon dioxide inhalations, 
269 

experimental neuroses and, 
263-64 

new stimulus situations 
and, 240 

criterion analysis, 165-66 

diagnosis 
multiple psycb’atric, 291 
new concepts, ““4-66 
psychologic factors in, 

274-76 
research studies in, 267 
standard clinical tools 
and, 274-75 

etiology 
new concepts of, 264-66 

experimental investiga- 

tions, 263-64 

factor analysis of, 131-32 

fear 
drugs on, 263 

fixations concept, 43, 242- 

44, 263, 264, 319 
frustration tolerance and, 
319 

hypersomnolence, 255 

hysterical deafness, 258 

as interpersonal incompe- 

tence, 318 

interviewing, 266-67 

learning difficulties, 17-18 

leucotomy on, 316 
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manic depressives 
biochemical factors in, 256 
criterion analysis of, 165 
manual of, 264-66 
mental defects 
assessment of, 370 
glutamic acid on, 137 
Stanford Binet test of, 370 
statutory definition of, 19, 
370 
military psychoses, 271 
in military settings, 270-72 
civil defense and, 272 
combat neuroses, 271 
postcombat neuroses, 272 
prisoner of war reactions, 
271 
selection and performance, 
270-71 — 
therapy, 271-72 
neuroses, 264-72 
accident proneness, 16-17 
ACTH on, 269-70 
biociemical factors in, 256 
cortisone on, 269-70 
drug therapy, 269 
etiology of, 268 
experimental, 263-64 
Freud’s concept of, 319 
hormonal therapy, 269-70 
hypnotherapy, 270 
interviewing, 266-67 
Méniére’s disease and, 268 
in military settings, 270-72 
new taxonomy of, 265-66 
nosology of, 2°4-é6 
physiologic stuciies, 267 
postcombat, 27: 
prevention of, 265-69 
psychological studies, 267 
psychzinerapy of, 270 
Rorschach test of, 258 
social origins, 318 
surgery on, 16 
therapy of, 269-70 
nosology of, 264-66 
photogenic epilepsy, 260 
physical handicap and, 18 
physiologic correlates, 263 
physiologic studies, 267 
problem behavior, 14-15 
psychologic studies, 267 
psychopaths 
psychological tests for, 291 
psychotherapy for, 269 
and social deviation, 269 
psychoses, 272-77 
classification of, 275 
diagnosis, 274-76 
differential diagnosis, 273 
dynamics and performance, 
273-74 
factor analytic studies, 273 
group therapy, 329 
in military settings, 271 
normality and, 131-32 
process oriented research, 


272-74 
prognosis, 276, 298 
research trends and omis- 
sions, 274 
social evaluations, 276-77 
therapy, 274-76 
psychosomatic dysfunctions, 
267-68 
Sydenham’s chorea on, 14-15 
therapy 
comparative effects of, 
275-76 
evaluation of, 275-76 
group, 276-77 
hospital attitudes on, 277 
psychologic factors in, 274 
see also Behavior; Behavior- 
al development; Brain 
damage; Delinquency; 
Psychosomatics; Psycho- 
therapy; and Schizophren- 
ia 


Behavioral development 


*fabsolute’’ theory, 2 
art appreciation 
color on, 362 
maturation change, 362 
realism on, 362 
sex on, 362 
basic concepts of, 1-4 
castration concept, 4 
descriptions of, 1-4 
fixation, 3 
Gesell-Ames Schedules, 2 
Gestalt theory, 2 
libidinal, Freudian theory of, 
testing of, 3-4 
Oedipus complex concept, 4 
phylogenetic law, 1-2 
reflexes 
disappearing, 2 
monosynaptic stretch, 2 
plurisynaptic cutaneous, 2 
regression, 3 
stages (Freud), 3-4, 164 
tendencies 
intercorrelations among, 


see also Adolescence; Be - 
havior abnormalities; 
Child behavior; Child psy- 
chology; Children; Infant 
behavior; Learning; Matur- 
ation; Personality; Person- 
ality, structure; and 
Speech, development of 


Brain 


ACTH on, 257 


damage 
agnosia and, 259-60 
alexia and, 259-60 
on behavior, 251-55 
chronic lesions, 255 
Maier String Problem, 260 
satiation effects and, 259 

leucotomy, 316 

lobotomy 





436 


on behavior, 254 
peripheral sensory stimula- 
tion, 254-55 
prefrontal lobectomy 
on behavior, 254 
transorbital lobotomy 
on olfactory acuity, 119- 
20 


Cc 


Chemical senses, 111-24 
gustation, 111-17 

individual differences in, 
113-15 

in insects, 115-16 

interval scale for sweet- 
ness, 116 

learning of, 116-17 

neurophysiology of, 111- 
13 

pathway of, 111-12 

receptor sensitivity in, 
113 


see also Taste 
insect studies, 115-16, 121- 
22 
interest in field, 111 
odor 
character of, 121 
and chemical constitution, 
121 
chemical reduction of, 120 
chlorophyll and, 122-23 
deordorant capacity, 120 
industrial problems with, 
122-24 
intensity of, 121 
research problems in, 111 
strength of, 122 
olfaction, 117-24 
adaptation exploration, 121 
air conditioning problems, 
122 


amplitude versus frequency- 
modulation, 119 

apparatus, 120 

background potentials, 117- 
18 


chemical action of chloro- 
phyll, 123 

chlorophyll as deodorant, 
122-23 

classification system of, 
111 


direct system, 117 

electrical recording activ- 
ity, 118-19 

fatigue on, 120 

individual differences in, 120 

industrial problems in, 122 

insect studies in, 121-22 

masking thresholds in, 121 

nature of, 119 

neural responses in, 119 

neuroanatomical interac- 
tions, 117 
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neurophysiology of, 117-20 
physiological interaction, 
123 


on psychophysiological 
functions, 123-24 
in psychotics, 119-20 
rejection threshold in, 
121-22 
in sex and, 123 
spatial distribution on, 118 
stimulation in, 120 
thresholds in, 118, 119-20 
transorbital lobotomy on, 
119-20 
ventilating problems, 122 
vitamin A deficiency on, 
123 
olfactometers, 111 
reviews of, 111 
see also Sensory processes 


Child behavior 


age trends, 6 
aggressiveness 
biologic, 268 
ego disturbances and, 15 
frustration on, 16, 162, 
268 
mother attitudes on, 9 
neurotically hostile, 268 
punishment on, 3 
developmental trends, 4-6 
deviant, see Behavior abnor- 
malities 
family attitudes on, 8-11 
family practices on, 8-11 
feeding practices on, 10 
hospitalization on, 16, 268 
ordinal position on, 11 
parent behavior and, 3 
punishment on, 3 
sex differences in, 12-13 
sibling influence on, 11 
social class on, 6-8 
social-emotional 
family on, 8-9 
trait persistence, 4-6 
see also Adolescence; Be- 
havioral development; 
Child psychology; Child- 
ren; Infant behavior; and 
Personality 


Child psychology, 1-22 


behavior development, 1-4 
biosocial complex, 13 
cognitive processes, 19-22 
as deductive science, 3 


deviant behavior and attitudes, 


14-19 

evaluation of, 1 

general characterization of, 
1 


mental measurement, 19-22 
peer relations, 11-12 
personality organization and 
development trends, 4-6 
sex differences, 12-13 
social class differences, 6-8 





see also Adolescence; Be- 
havior abnormalities; 
Behavior development; 
Child behavior; Child- 
ren; and Infant behavior 
Children 
adjustment of 
physical handicaps on, 18- 
19 
social class on, 6-7 
appreciation development in, 
362 
attitudes of 
social class on, 6-7 
bilingualism in, 21-22, 138- 
39, 372 
concepts of 
social class on, 7-8 
ethnic attitudes of 
comparative study of, 195 
family on, 10-11 
generalized prejudice fac- 
tor in, 194 
sex on, 13 
interests of, 366 
language of, 8 
memory in 
social class on, 7 
mentally retarded, 19, 370- 
71 
problem, 370-72 
nail biting in, 17 
parental goals for, 7 
peer judgments of, 12 
peer relations of, 11-12 
external characteristics on, 
11 
gauges of, 11-12 
performance on tests, 21 
physical handicaps of 
cardiac, 1 
cerebral palsy, 19 
social goals of 
family on, 10 
of, 2-3 
visual acuity in, 66-67 
see also Adolescence; Beha- 
vioral developments; Child 
behavior; Child Psychology; 
and Infant behavior 
Cochlea 
anatomy of, 90-91 
dimensions of, 90-91 
electrical dynamics of, 91-92 
frequency resolution of, 90 
innervation, organ of Corti, 
80 


jet engine noise on, 234, 249 
lesions, 92, 100-1 
mechanical dynamics of, 89- 
91 
microphonic 
noise on, 95-96 
spatial origin of, 91 
phase contract microscopy of, 
90 
tectorial membrane, 90 











*‘tuning’’ of, 97 
see also Deafness; Ear; and 
Hearing 
Communication, 401-18 
administrative, 222 
advertising variables, 220 
amount of information, 408 
channels 
context and, 406-8 
redundancy problem, 406- 
8 


comic strip in, 190 

contexts, 406-8 

economic theory and, 416- 
18 

international propaganda, 
190 

language 

ambiguity, 405-6 

basic phonemes, 404 

frequency and rank order 
relation, 413-16 

meaning, 404-5 

+ segments of, 403- 


translating machine, 405- 
6 


Zipf’s law, 413-16 
meaning 
measures of, 412-13 
mediation hypothesis, 412 
a me differential, 412- 
1 
meaningfulness, 413 
mediation hypothesis, 412 
“random noise, ’’ 411 
redundancy problem, 406-8 
semantic differential, 412 
semantic information 
theory, 408-9 
“‘semantic noise,’’ 409, 411 
semantic space, 412 
speech analysis 
binary notation for 
phonemes, 402 
distinctive features cata- 
logue, 403 
distinctive features theory, 
401-2 
— techniques, 


engineer’s approach, 402- 
3 
theory and research in, 189- 
90 


transmitted information 
measure of, 409-11 

value of inquiry, 416-18 

within teams, 416-18 

Zipf’s law, 413-16 

see also Attitudes; Group 
processes; Social psy- 
chology; and Speech 

— psychology, 229- 


abnormal behavior fixations, 
242- 44 
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audiogenic seizures, 234- 
37 


behavior 
disorder studies, 239-44 
early environment on, 231- 
33 
electroshock convulsions 
on, 237-39 
see also Animal psychology, 
behavior; Behavior; and 
Psychotherapy 
conceptual trends in, 229 
conflict studies, 239-44 
experimental psychogenetics, 
230-31 


homologous behavior pheno- 
mena, 229 

sensory processes, 233-34, 
248-50 

social psychology, 173 

ry reaction studies, 241- 


see also Animal psychology; 
and Physiological psy- 
chology 
Conditioning 
see Learning 
Counseling 
on academic performance, 
348 


approaches to, 340 
client motivation in, 342 
“counseling psychologist, ’’ 
337-38 
counselors 
APA code for, 338 
characters of, 338 
in counseling process, 339 
education of, 338 
in-service development, 
338 
interpretations of, 349 
selection of, 338 
training of, 338 
counselor-client relation- 
ship, 340 
of delinquents, 323 
ethical standards, 338 
evaluation 
basic conditions of, 342 
criteria, 342 
—_ participation scale, 


verbal participation criter- 
ion, 342 
integration of, 341 
‘“*integrational, ’’ 339 
interest patterns 
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marriage, 349 
methods, 337-51 
occupational information, 
349-51 
expert-opinion analysis, 
350-51 
Shartle’ basic book on, 
350 


outcome 
preparation for, 348 
procedures in, 348-49 
process 
counselor in, 339 
ego-building, 339 
learning theory and, 339 
vocational development 
theory, 339 
as profession, 337-38 
education, 338 
progress of, 337 
psychological measurement 
in, 343-48 
of academic efficiency, 345 
aptitude tests, 346 
college grades, 348 
interest measurement, 343- 
44 
personality measures, 344- 
46 


prediction studies, 346-48 
of study habits, 346 
psychotherapy and, 337 
research and evaluation 
methodology, 341-43 
research methods in 
advance in, 343 
analysis of, 341 
basic essentials, 341-42 
common theories of, 341 
design development, 341 
self-concept theory, 340 
specific procedural elements, 
348-49 
on study habits, 342 
systematic formulations and 
theory, 339-41 
vocational choice, 349-51 
compromise process, 339 
fantasy-reality on, 349-50 
Ginsberg theory of, 349 
limitations of, 339 
parental occupation and, 350 
ten-point theory, 339 
see also Psychotherapy 


D 


Deafness 


academic achievement and, articulation-gain function and, 
343-44 103-4 


college student experiences 
and, 344 
differences in, 344 


auditory fatigue and, 101-2 
see also Hearing, auditory 
fatigue 


‘‘medical-men-in-general,’’ auditory nerve, 99-100 
343 


interviewing 
dynamics of, 349 


cochlear or scalar 
effects of, 100 
loudness recruitment and, 
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98-99 
Méniére’s disease, 98-99 
diagnosis of 
psychophysical tests for, 
98 


eighth nerve, recruitment 
and, 99-100 
fenestration operation on, 
105-7 
hysterical 
physiological approach to, 
258 


intelligence testing and, 371 
measurement of, 89 
Méniére’s disease 
articulation-gain function 
and, 103 
auditory fatigue and, 101- 
2 


loudness recruitment in, 
99 


symptoms of, 98-99 
peripheral, 98-105 
loudness recruitment and, 
98-99 
tests for, 102 
type of, 98 
sensori-neural, 98-105 
articulation-gain function 
and, 103 
cause of, 105 
theory, 105 
speech intelligibility and, 
103 


transmission 
articulation-gain function 
1 
auditory fatigue and, 101 
see also Cochlea; Ear; and 


Hearing 
Delinquency, 15-16 

aggression in, 15 
counseling and, 323 
identification scores and, 163 
intelligence and, 371 
Minnesota Multiphasic Per- 

sonality Inventory, 301 
multiple therapy, 327-28 
personality characteristics, 

15 


prediction of, 301 
psychotherapy and, 323 
in social development pro- 
cess, 371-72 
symptoms of, 293 
vandalism and, 15-16 
see also Adolescence; and 
Behavior abnormalities 
Differential psychology, see 
Individual differences 
Drive 
anxiety as, 35, 47-48, 156 
discriminating, 37 
on habit strength, 151 
intensity of, 34-35, 40-41 
irrelevant, 35, 41 
performance and, 40-42 
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in serial rote learning, 152 

Strong Vocational Interest 
Blank and, 344 

type of, 34-55, 41-42 

see also Learning, motiva- 
tion in; and Motivation 


E 


Ear 
anatomy of, 89-92 
Brownian motion in, 92-93 
fenestration operation on, 
105-7 
free-field versus canal 
pressures, 93-94 
labyrinthine hydrops, 104 
medial geniculate body 
neural activity of, 91-92 
middle ear infections 
audiogenic seizures and, 
235-36 
physiology of, 89-92 
sensory epithelium 
streptomycin on, 92 
see also Chochlea; Deafness; 
and Hearing 
Education 
academic success prediction, 
299-300, 366 . 
arithmetic, 362 
bilingualism in, 372 
college reading, 360-71 
group therapy on, 360 
rate of, 360-61 
remedial, 360 
reversals, 360-61 
elementary school reading, 
359-60 
background music on, 360 
book illustrations on, 361 
group teaching, 359 
radio listening and, 359-60 
teaching methods, 359 
textbooks on, 359 
elementary school subjects 
research in, 357-58 
emphasis in, 357-58 
of highly intelligent, 371 
language, 362 
* mentally defective, 370- 
1 
performance 
counseling on, 348 
problem children 
aphasia, 371 
intelligence, 370-71 
sensory defects, 371 
professional school success 
prediction, 299-300 
reading, 358-61 
research results on, 357 
social differences in, 368 
spelling, 362 
teachers 
attitudes of, 369-70 
behavior factors, 293 





competence of, 369, 370 
participation in research, 
358 
pupils’ evaluation of, 300 
see also Educational psy- 
chology; and Learning 
Educational psychology, 357- 
73 


academic success 
prediction of, 366-67 
social differences on, 368 
appreciation development, 
362 
attitude studies, 365-66 
children’s interests, 366 
college reading difficulties, 
360-61 
definition, 357 
delinquency, 371-72 
elementary school reading, 
359-60 
general learning ability 
363 


guidance and selection, 366- 
68 
intelligence tests in, 367- 
68 


social class in, 368 
individual differences, 366 
intelligence tests validity, 

367-68 
practice teaching, 370 
problem children, 370-72 
progress measurement, 362- 
63 


promotion by age, 364 
race and language, 372 
reading skills 
background music on, 360 
emotional bias on, 360 
group methods on, 359 
radio listening on, 359-60 
school curriculum and, 361 
word recognition improve - 
ment, 359 
research and practice, 357- 
58 
evaluation of, 357-58 
*“follow-up’’ research, 358 
‘frontier’’ research, 358 
group relationships, 357- 
58 
teacher participation in, 
358 
school subjects, 358-63 
social aspects, 363-66 
social relationships 
constancy of, 365 
studies in, 364 
“‘sociograph, ’’ 365 
sociometric status 
emotional adjustment and, 
264 


pupil assessment of, 364 

pupil awareness of, 364 
sociometric studies, 363-65 
staff relationships, 363 











-XUM 


teacher attitudes 
to criticism, 357 
teacher-researcher liaison, 
358 


teachers, 369-70 
teaching competence, 369, 


see also Education; and 
Learning 
Electroshock convulsion 
on behavior, 237-39 
Emotion 
conditioned responses 
electroshock convulsions 
on, 239 
on sociopolitical attitudes, 
191 


Ethnocentrism, see Prejudice 
Extinction 
anxiety on, 35 
deprivation on, 35 
Hull’s doctrine of 
objections to 28 
intertrial intervals on, 39 
reinforcement and, 38-39 
resistance to, 44-45 
**toleration method’? of, 
42-43 
verbal forgettings, 46 
see also Learning; and Re- 
tention 


F 


Factor analysis, 389-91 
basic objective of, 127 
basic traits, 166-67 
closure factors in 

“flexibility of,’’ 129-30 
“‘speed of,’’ 129-30 
temperament variables 
and, 130 
‘‘creative thinking, ’’ 130 
criterion analysis, 165-66 
of criterion variables, 133 
evaluative abilities, 130 
in experimental psycho- 
pathology, 273-74 
“‘factor,’’ meaning of, 128 
factor structure hypothesis, 
390 


group factoring methods, 
389-90, 391 

hierarchical organization of 
cognitive ability, 129 

high speed computers, 391 

of individual differences, 
127-28 

in industrial psychology 
tests, 205 
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of occupational criteria, 133 

in perceptual domain, 129- 
30 

of personality measures, 
166- 

of personality variables, 
285, 286 

of political activity compo- 
nents, 193 

of predictor variables, 134 

of primary mental abilities, 
129 


of projective tests, 285 
in psychological theory, 127- 
28 


of psychotics, 131-32 

of reasoning abilities, 129 

reasoning factor, 134 

of sales success criteria, 
214 

score matrices factoring, 
389-90 

of spelling test, 362 

of streetcar drivers, 211 

of Study of Values, 345 

termination of, 389 

test equivalents of persons, 
390 


theoretical difficulties, 389 
Wechsler-Bellevue Test 


‘‘when to stop, ’’ 389 
of work interests, 132 
see also Statistical methods 
Fear 
acute 
blood sugar level, 263 
gastric acidity, 263 
chronic 
blood sugar level, 263 
gastric acidity, 263 
sodium amytal on, 263 


G 


Genetic factors 
see Behavior, genetic factors 
on 
Group processes, 171-98 
conceptual formalization, 173- 
74 


frustration-aggression se- 
quence, 179 
group behavior 
deindividuation, 177 
individual-members in- 
fluence on, 176 
psychology of, 329 
research methods in, 329 
group formation 


job analysis schemes and, 206 determinants of, 176-77 


matrix equations, 389-90 


of mechanical comprehension 


battery, 138 
methods controversy, 127 
multiple group factor com- 
putation, 140 


success and failure on, 176- 
77 
group learning 
definition of, 175 
productivity and, 179 
variables in, 175 
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group productivity, 179-81 
abilities on, 180 
attitudes and, 219 
in creative activity, 180- 

81 


in critical activity, 180-81 
differential skills on, 180 
experience on, 180 
factors of, 179-80 
goal on. 180 
leadership in, 180 
learning and, 179 
predictor variables, 180 
sociometric regrouping 
procedure on, 180 
group relationships, 175-76 
group structure 
determinants of, 176-77 
group teaching, 359 
group therapetuic orienta- 
tion, 276 
group therapy, 228-32 
influence process, 181-83 
audience in, 182-83 
group-anchored attitudes 
and, 1 
group-derived restraints, 
182 


individual differences, 182 
on individual judgment, 181- 
82 
leadership on, 183 
variables in, 181 
intragroup tensions, 177-78 
communication in hostility 
reduction, 178 
family, 177-78 
“‘stranger,’’ 178-79 
job selection 
group methods for, 212-13 
mathematical models, 173- 
75 
graph theory, 174-75 
for group structure, 174 
for participation distribution, 
174 
models and methods, 173-75 
opinion change 
leadership on, 183 
psychotherapy, 272, 317-18 
social group concept, 175 
social psychology and, 171-98 
social therapy, 276-77 
theory of groups, 175-76 
see also Attitudes; Communi- 
cation; Leadership; Social 
behavior; and Social 
psychology 


Gustation 


see Chemical senses, 
gustation; and Taste 


H 


Hearing, 89-108 
absolute sensitivity, 92-93 
anatomy of, 89-92 
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articulation-gain function, 
103-4 


articulation index, 89, 104 
attributes of 
psychophysics of, 92-96 
audibility curve, 93 
audiometer standardiza- 
tion, 93 
audiometry, 107-8 
auditory fatigue, 96-98 
acoustic trauma, 96-97 
adaptation, 96-97 
deafness and, 101-2 
discussion of, 89 
recovery from, 97-98 
short-duration, 97 
see also Deafness, audi- 
tory fatigue and 
auditory flutter, neural 
mechanism of, 95 
bone -conduction audio- 
metry, 106 
Brownian motion, 92-93 
cochlear lesions and, 92, 
100-1 
cortical lesions and, 92 
critical band and, 89 
difference limen determin- 
ation, 100-1 
fenestration operation 
audiometric changes fol- 
lowing, 105-7 
hearing loss measure- 
ments after, 105-6 
instrumental avoidance 
conditioning and, 106 
free-field versus ear can- 
al pressures, 93-94 
frequency discrimination 
cortical lesions on, 92, 
252-53 
measurement of, 95 
pitch, 95 
intensity discrimination 
cortical lesions on, 92, 
252-53 
loudness of noise, 95-96 
loudness recruitment, 89, 
98-99 
auditory fatigue and, 102 
cochlear lesions and, 100 
detection of, 100-1 
hyperrecruitment phenom- 
enon, 102-3 
measures of, 99 
speech and, 103 
masking in, 94-95 
pure tones and, 94 
self-noise, 94 
in mice, 234, 249-50 
‘“‘missing 6 db,’’ 93-94 
Paracusis Willisii, 107 
physiology of, 89-92 
space-time pattern theory 
of, 89 
speech analysis, 107 
speech tests for, 107 
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thresholds in 
Brownian motion and, 
92-93 
measured stimulus energy 
and, 92-93 
measuring of, 93 
psychophysics of, 92-96 
variability in, 102 
tinnitus aurium 
in the deaf, 107 
in Méniére’s Disease, 98- 
99 
in normal ear, 107 
volume of noise on, 96 
see also Cochlea; Ear; and 
Deafness 


I 
Individual differences, 127- 
42 


in academic grades, 133 
in adaptability, 133 
age on, 136-37 
in athletic proficiency, 133 
in complex tasks, 133 
dimensions, 128-35 
of ability, 129-30, 139-40 
of attitudes, 132-33 
criterion measures, 133- 
34 
experimental studies of, 
134-35 
identification techniques, 
139 
of interests, 132-33 
of personality, 130-32 
problems of, 127 
on egalitarian educational 
policy, 366 
ego threat and, 167 
ethnicity and, 136 
factor analysis of, 127-28 
general cognitive ability, 
129 


in group pressure responses, 
182 


in gustation, 113-15 
intellectual ability, 129 
masculinity-femininity, tests 
of, 131 
measurements of, 127-28 
in occupational pursuits, 133 
in olfaction, 118, 120-21 
in perception, 129-30 
in practical ability, 129 
in primary mental abilities, 
129 
problems in field of, 127 
psychometric instruments, 
141-42 
new tests, 141 
test evaluation, 141-42 
test standardization, 141- 
42 


psychometric methodology, 
139-41 





ability tests, mathemati- 
cal theory of, 139-40 

factor analysis of, 140-41 

multiple discriminant 
function, 141 

scaling theory, 139 

test construction problems, 
140 


psychomotor tests of, 130 
in psychosis, 131-32 
in reasoning ability, 129 
socioeconomic status on, 
137-38 
specific training on, 138 
stability of, 135-36 
variables in, 135-39 
environmental, 137-39 
genetic, 136-37 
physiological, 136-37 
see also Personality 
Industrial psychology, 203- 
25 
absence, 217 
accidents, 217 
acquaintance ratings, 214 
application blanks, 209-10 
aging, 216 
attitudes and morale, 219- 
20 
criteria, 213-14 
criterion problem, 204 
European work, 205 
statistical methods, 205 
subjective evaluation, 205 
vocational guidance, 205 
fatigue measuring, 217 
hiring and training costs 
tests on, 384 
indications, 224-25 
interviewing, 210-287 
job evaluation, 206-7 
factor analysis and, 206 
office jobs check list, 206 
job performance criteria, 
213-14, 293 
job satisfaction 
personality on, 215 
proficiency and, 215-16 
studies of, 215-16 
variables, 215-16 
joint consultation, 221-22 
leaderless group discussion, 
212-13, 293-94 
leadership and management, 
220 -24 
attitude changing, 223 
education for, 222 
group leadership, 221 
self-awareness, 222 
training program and, 222- 
23 


see also Leadership 
morale 
efficiency and, 216 
objectives of, 203-4 
occupational adjustment cri- 
teria, 214-15 








payment systems, 206 
personality factors and, 
208 
predictor tests 
of industriousness, 212 
on medical students, 210- 
11 
physical profiling, 211- 
12 


‘‘personality’’, 212 
of sales success, 293 
structural dexterity, 211 
of streetcar drivers, 211 
turnover and, 210 
rehabilitation, 216 
sales performance 
prediction of, 298-99 
sales success criteria, 214 
selection and placement, 
209-13 
clinical approach, 210 
of engineers, 293 
group selection methods, 
212-13 
interviewing, 210 
personality inventories, 
212 
physical characteristics 
category, 351 
pilots, 292 
tests, 210-12 
Statistical knowledge and, 
204 
supervision, 217-19 
evaluation procedures of, 
217-18 
supervisors 
ability rating of, 218 
behavior dimensions, 293 
handbook for, 218-19 
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time study, 207 
of traveling time, 207 
see also Communication; 
and Group processes 


Infant behavior 


family on, 9 


feeding difficulties and, 
1 
see also Child behavior 


IQ 


coaching on, 367 

economic status on, 368 

family on, 9 

limitations of, 368 

practice effects, 367 

social relations on, 368 

special abilities and, 371 

Stanford Binet, 19-22, 142, 
370-71 

Wechsler Intelligence Scale 
for Children, 19-22 

see also Tests, intelli- 
gence 


K 


Kuder Preference Record 


interest patterns and, 344 
job satisfaction and, 300 
situation tests and, 295 
studies of, 132 

Study of Values and, 132 
on supervisors, 218 

in vocational guidance, 209 


L 


Latent learning, 37-38 


drive discrimination in, 37 
electroshock convulsions 
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Rorschach test, 294 

skills transfer, 220-21 

see also Industrial psy- 
chology, leadership and 
management 


Learning, 27-51 


“alimentary’’ conditioned 
responses, 116-17 
anxiety on, 17-18, 35, 
151-55 
see also Anxiety, in 
le 
behavior support in, 35-36 
classical and instrumental 
conditioning 
expectancy acquisition in, 
34-40 


performance in, 40-45 
conceptual analysis, 29 
configurational, 31-32 
deprivation on, 35 
disappearing reflexes and, 

2 


discrimination, 29-34 
compound stimuli on, 31 
conditions of, 29-31 
early learning on, 29-30 
brain damage on, 259 
in laboratory animals, 31, 

253-54 
motivation in, 34 
patterns of, 31-32 
reversal, 32-33 
secondary reinforcers in, 
30 
simultaneous, 32 
of size, 31-34 
stimuli effectiveness in, 
31 
successive, 32 


objective assessment of, on, 238-39 training on, 30 
217 “general maze-wiseness’’ see also Visual perception 
training of, 218-19 and, 37-38 discrimination-learning 


textbooks in, 206 
training procedures 
evaluation of, 218 
role playing test, 218 
trends in, 203-5, 224-25 
vocational guidance, 207- 
9 


interests assessment, 


irrelevant incentive in 
spacial position on, 37 
prefeeding place on, 38 
relevant incentive in 
motivation degree for, 37 
unrewarded preliminary 
exploration on, 38 


Leadership, 220-24 


sets, 33-34 
stages in, 33 
“transfer suppression,’”’ 
33 
drive strength in, 27 
electroshock convulsions 
on, 237-38 
empirical attitude toward, 


208-9 behavior 362-63 

occupational choice description of, 184 expectancy in, 34-40, 43- 
theory, 208 test for, 184 44 

personality inventory fak- critical prediction, 294 alteration on, 44 
ing, 208 differences in, 221 


sampling problems, 209 
validation problems, 
207-8 
work adjustment 
personality and, 288 
work environment, 216-17 
worker attitudes survey, 


group, 221 
group influence and, 183 
group productivity and, 180 


leaderless group discussion, 


212-13, 293-94 
measurement and training, 
183-84 


basic variable governing, 
34 


discrimination process 
and, 2 

drive and, 40 

sy discrimination and, 


incentive value and, 40 


219-20 personal factors in, 184 postulates for, 27 
work measurement, 206-7 potential assessment of, stimulus-response and,28 
manipulative time, 207 184 fixations habit concept, 43, 
motion analysis, 207 responsibility scales, 221 242-44, 263 
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frustration-aggression se- 
quence, 161-62, 179 

general ability, 363 

generalization studies, 
42-43 

greco-latin research de- 
signs, 392-93 

group superiority in, 179 

habit strength in, 27-28, 
30, 32 

anxiety and, 47-48, 151- 

52 


tests of propositions about, 
134 


Hullian theory of, 27-28, 
152, 154, 241, 263 
incentive motivation in, 
27-28 
intertrial intervals and, 39 
motivation 
factors in, 48 
insulin-induced hunger 
as, 320 
kinds of, 17-18 
tests of, 17-18 
motor, 49-51 
distribution on, 46-47 
practice distribution, 
49-50 
reminiscence on, 46-47, 
50-51 
transfer of, 51 
multiple role playing tech- 
nique, 183-84 
perceptual training ex- 
periments, 30 
performance 
adrenocortical activity, 
257 


anoxia on, 231-32, 248 

alternation in, 44 

alternative expectancies 
in, 43-44 

brain damage on, 251-54 

determinants of, 40-45 

drive and, 40-42 

drugs on, 231 

electroshock convulsions 
on, 238 

fixation on, 43, 242-44, 
263 

of mentally ill, 273 

pointer-reading respon- 
ses, 44-45 

regression in, 44 

retroactive inhibition on, 
51 


x-irradiation on, 231, 248, 
250 
see also Animal psycholo- 
gy, learning performance 
by psychotherapists, 327- 
28 


reaction potential 
equation for, 27 

reading 

perceptual aspects, 359 
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perceptual training on, 
360 
reinforcement in, 263 
relational, 31-3: 
restraint on, 36 
reversal-learning set, 
32-33 
“‘rigidity’’, 33, 134-35, 
273-74 
of schizophrenics, 319-21 
**stimulated recall’’ 
studies, 17-18 
stimulus in 
intensity of, 35 
type of, 35 
single-trial S-R contiguity, 
28 


space on, 36 
stimulus-response theory 
analysis of, 29 
expectancy and, 28 
terminal-stimulus in 
frequency, 38 
incentive value of, 36-37, 
40 


39 
quality of, 36-37 
reinforcement, 38 
theory, 27-29 
in counseling, 339 
psychotherapy and, 318-19 
time delays on, 39 
time intervals on, 35-36 
verbal, 45-49 
anxiety on, 47-48 
association disturbances 
on, 48 
distribution on, 46-47 
failure on, 48 
familiarity on, 45 
law of effect in, 47-48 
meaning on, 45 
motivational factors in, 48 
practice conditions on, 
45-46 
reminiscence on, 46-47 
testing conditions on, 45- 
46 


warm-up activity on, 51 

see also Behavioral devel- 
opment; Drive; Extinc- 
tion; Group processes, 
group learning; Latent 
learning; Perception; Re- 
inforcement; Retention; 
and Speech, development 
of 


M 


Manic depressives, see Be- 
havior abnormalities, 
manic depressives 

Maturation 

appreciation, 362 
drawing, 362 
psychotherapy and, 326 
on reading, 360 





reflexes and, 2 

sequence, 2 

see also Behavioral develop- 
ment 


Memory 


neural organization, 274 
of neurotics, 266 
see also Retention 


Mental disorders, see Be- 


havior abnormalities 


Minnestoa Multiphasic Per- 


sonality Inventory 
academic achievement and, 
346-47 
anxiety items, 150 
authoritarian personality 
and, 197 
Caudality Scale, 305 
on children, 301 
in conditioning studies, 35 
cross-validation check, 345 
delinquency and, 301 
family attitudes on, 8 
identification and, 162-63 
liberal arts students and, 
347 


medical students and, 346- 
47 

in prediction setting, 345 

as prognostic indicator, 276 

practice teaching and, 300 

research review of, 345 

Rorschach test and, 345 

self-concept and, 340 
Motivation 

in academic achievement, 


ambivalent stimulation, 264 
in discrimination learning, 
34 


incentive, in learning, 27-28 

insulin-induced hunger as, 
320 

interests as, 343-44 

neural organization, 274 

noxious stimulation, 253-64 

in psychotherapy, 342 

in speech learning, 10 

Strong Vocational Interest 
Blank and, 344 

on study habits, 342 

in visual perception, 76-77 

see also Drive; and Learn- 
ing, motivation in 


N 


Neuroses, see Behavior ab- 
normalities, neuroses 


fe) 


Odor 


see Chemical senses, odor 


Olfaction 


see Chemical senses, olfac- 
tion 
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P 


Perception 
cerebral palsy on, 19 
early learning on, 29-30 
hunger and, 259 
psychotic process on, 272- 
13 
social, 195-97 
authoritarian personality 
and, 196-97 
empathy in, 196 
Honi phenomenon, 197 
projection in, 195-96 
thresholds 
practice on, 30 
training experiemnts in, 30 
see also Learning; and 
Visual perception 
Personality, 149-67 
age trends, 6 
aggression, 161-62 
anxiety 
general scale of, 150-55 
in testing situation, 155-59 
authoritarianism 
measurement of, 131 
perception and, 196-97 
prejudice and, 194 
basic traits 
factor analysis and, 166- 
67 
categories of, 149 
dimensions of, 130-32 
complexity-simplicity, 286 
criterion groups of, 165-66 
empirical tests and, 284 
neuroticism, 285-86 
psychotic, 165-66, 285 
schizothymic-cyclothymic, 
165-66 
dynamics of 
increased knowledge of, 340 
ego defense mode, 167 
emotional adjustment 
self-acceptance and, 340 
evaluation of, 281 
factors of 
ethnic attitudes and, 193 
genetic, 285-86 
identification, 162-63 
intellectual performance 
psychodynamic influences 
on, 150-61 


166-67 
intraindividual variation in, 
166-67 
job satisfaction and, 215 
measures of, 150-51 
factor analysis of, 166-67 
primary variables, 285 
sociopolitical attitudes and, 
191 
validity of, 291-92 
organization, 4-6, 288 
performance level 


SUBJECT INDEX 


eyeblink conditioning and, 
151 


frustration on, 159-60 
physique on, 5-6 
pleasure-oriented, 208 
political participation and, 192 
productivity and, 288 
psychodynamics and per- 

formance, 150-61 
miscellaneous studies, 
160-61 
responses to frustration, 
159-60, 161 
self-concept theory 
emotionality and, 340 
structure, 163-67 
testing 
social class on, 137-38 
trait concepts 
classification of, 285 
criterion analysis, 165-66 
evaluation of, 164-65 
identification of, 285 
objective measurement of, 
285, 306 

oral character, 164 

psychodiagnostic categories 
and, 165-66 

psychosexual development 
and, 163-65 

situation and, 295 
trait persistence, 4-6 
variables 

in situation tests, 294-95 
work adjustment and, 288 
work oriented, 208 
see also Anxiety; Behavioral 

development; Individual 
differences; and Minne- 
sota Multiphasic Person- 
ality Scale 

Physiological psychology, 247- 
60 


ACTH, 235, 257, 267-70 
adrenals 
electroshock convulsions 
on, 256-57 
adrenocortical activity 
goal-performance divergence 
and, 257 
adrenocortical mechanisms, 
256-58 
biochemical correlates of 
behavior, 255-56 
brain damage and behavior, 
251-55 
— problems and, 258- 


electrical skin activity, 260 
hunger 


hormone theory of, 258 
perception and, 259 
muscle contraction 
cortisone on, 256-57 
palmar skin resistance, 260 
sensory procesees, 233-34, 
248-50 
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skin temperature changes 
recording of, 260 
x-irradiation, 250-51 
on brightness discrimina- 
tion habit, 250 
—_ environment and, 


on metabolism, 250-51 
on nervous system, 250 
see also Animal psychology; 
Behavior; Brain; Com- 
parative psychology; 
Vision; and Visual percep- 
tion 


Prejudice, 193-95 


authoritarian personality, 
194 


contact and, 194-95 

correlates of, 193-94 

determinants of, 193-94 

expressed attitudes 

overt behavior and, 195 

learning of, 11 

among white and Negroes 

see also Children, ethnic 
attitudes of 

Psychiatry 

behavior biodynamics and, 

264 


insight in, 270 
interview, 287 
preventive, 268 
Psychoanalysis 
ahistorical method, 317 
of children, 269 
client - centered therapy and, 
314-15 
free association 
limitations of, 317 
group, 317-18 
historical method, 317 
neurophysiological process 
and, 274 


orientation expansion, 317- 
18 


peptic ulcer and, 325 

psychosexual development 
theory, 163-65 

research motivation in, 333 

resistance in, 314-15 

as science, 332 

task of, 333 

trait concepts, 163-65 

transference in, 314-15 

see also Behavioral abnor- 
malities, therapy; and 
Psychotherapy 

Psychodiagnosis 

multiple, 291 

new concepts, 264-66 

prognosis, 276 

of schizophrenia, 275 

standard clinical tools and, 
274-75 

symptom ratings, 275 

see also Assessment; Be- 
havior abnormalities, 
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diagnosis; and Tests 
Psychoses, see Behavior ab- 
normalities, psychoses 
Psychosomatics 
age and, 267-68 
animal studies, 263. 
asthma, 325-26 
breast cancer and, 270 
bronchial asthma, 268 
cardiac neuroses, 267 
childhood experiences on, 268 
diabetes, 325-26 
hypertension, 325-26 
irritable colon, 325-26 
peptic ulcer, 325-26 
therapy and, 270, 325-26 
ulcerative colitis, 325-26 
urticaria, 325-26 
vasomotor rhinitis, 325-26 
see also Behavior abnor- 
malities 
Psychotherapy, 311-34 
changes during 
categories for, 311-12 
determination of, 326 
discomfort relief quotient, 
312, 313 
palmar sweat, 313 
P-technique, 313 
verbal behavior, 313 
client-centered 
definition of, 316 
expositions of, 340 
involuntary patients and, 
315-16 
psychoanalysis and, 314-15 
studies of, 312, 340, 341 
client-therapist interaction, 
314-15 
cultural differences on, 
321 
factors in, 340 
conceptual matrix, 312 
continuation-termination of 
Rorschach test, 296-98 
counseling and, 337 
different orientations in 
common factors in, 318 
expansion, 317-18 
experimental study of, 313- 
14 
interpretation in, 314-15 
new therapy knowledge 
from, 315-16 
different types of 
common elements in, 324-25 
evaluation of, 326 
outcome of, 323-24 
dynamic, 270 
electroshock convulsions, 
237-39 
on adrenals, 256-57 
see also Behavior,electro- 
shock convulsions on 
“equivalence of stimuli, ”’ 
325 
evaluation of, 275-76 
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behavior rating scales, 
275-76 
Freud’s concept of 
emendations of, 319 
generic, 270 
group, 272, 276-77, 328-32 
college reading difficulties, 
360 
Industrial Neurosis Unit, 
331-32 
new therapist on, 331 
outcome, 331 
problems of, 329-30 
psychoanalytic approach, 
317-18, 328 
special project in, 329-31 
hypnotic interpersonal re- 
lationship, 321 
individual 
outcome, 322-26 
process, 311-22 
insight in, 270 
interpretation 
nature of, 314 
training frame of reference 
on, 314 
interviewing 
dynamics of, 349 
involuntary clients, 315-16 
learning theory and, 318-19 
long-term 
brief intervention on, 327 
multiple therapy and, 327 
outcome of, 324 
military, 271-72 
multiple therapy, 327-28 
advantages of, 327 
long-term therapy and, 327 
neutral, 270 
outcome, 296-98, 322-26 
basic conditions of, 342 
client’s extraclinical be- 
havior, 341 
criteria for, 322 -24 
long-term therapy, 324 
patient change and, 326 
prediction in, 296-98, 311 
purposeful activities, 331 
in schizophrenics, 324 
in verbal relationships, 331 
patient evaluation, 296-98 
peptic ulcer and, 325-26 
pretherapeutic procedures, 
315-16 
process of, 311-22 
client-therapist interaction, 
314-15 
discomfort-relief quotient, 
312 
hypothetical model for, 318- 
19 


learning theory and, 319 

methodological approaches, 
312-13 

palmar sweat, 313 

P-technique, 313, 341 

Q-technique, 312, 341 








skeletal muscle action 
potentials, 313 
study methods of, 329 
verbal behavior, 313 
psychoanalytic techniques, 
270 
psychologic factors in, 274- 
76 


repression 
against superego, 319 
research in, 332-33 
resistance in, 313-16 
as resocialization process, 
318 
roots of, 322 
of schizophrenics, 316 
self-references, 312 
‘*separation reactions, ’’ 270 
social factors in, 321 
as social learning, 318, 328 
team-work approach, 268 
techniques of, 268-69, 313- 
14, 322 
tension factor in, 341 
therapist in 
action categories, 317 
behavior of, 313-14, 321, 
339 
colleagues and, 322, 327 
for family, 321-22 
in nondirective therapy, 
317-18 
psychoanalysis of, 328 
rated competence of, 338 
responses of, 313-14 
social factors of, 321 
training of, 314, 327-28, 
333-34, 338 
transference in, 314-15 
transference relations in, 
330 
verbal participation in, 312 
see also Behavior abnor- 
malities, therapy; Counsel- 
ing; and Psychoanalysis 


R 


Recruitment 


see Hearing, loudness re- 
cruitment 


Reinforcement 


amount of, 36 

automatic effects of, 47 

continuous, 33 

delay, 264 

differential and partial, 32- 
33 

frequency of, 27-28, 38-39 

partial 

in stereotyped reactions, 

263 


pattern of, 38-39 
primary, law of, 22 
secondary, 30, 39-40 
time delays in, 39 
type of, 36 
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see also Extinction 
Research design 
see Statistical methods 
Retention 
anoxia on, 232, 248 
in bilateral transfer, 50-51 
ego defense in, 48 
electroshock convulsions on, 
237-38, 239 
failure on, 48 
group, 49 
intensity on, 49 
in ipsalateral transfer, 50- 
51 
motivational factors in, 48 
in motor learning, 46-47, 
50-51 
practice distribution on, 50 
social attitudes on, 48-49 
stress on, 48 
in verbal learning, 46-47 
see also Extinction; and 
Memory 
Rorschach Test 
on anxiety, 150, 160 
child adjustment estimate, 
9 


on children, 303 
in counseling evaluation, 342 
critical evaluation trend, 
302 
diagnostic categories and, 
302 


group, 345 
in group therapy project, 
330 


Harrower group, 300 

on leaders, 294 

in learning studies, 17 

Minnesota Multiphasic Per- 
sonality Inventory and, 
345 


SUBJECT INDEX 


adolescent psychopaths, 305 
anxiety neurotics, 305 
social perception and, 195 
stress and, 160-61 
stutterers, 305 


s 


Schizophrenia 


attention disturbances study, 
273 

biochemical factors in, 255- 
56 


catatonic 
drugs on, 267 
criterion analysis of, 165- 
66 


diagnostic procedures, 275 

“frustration behavior, ’’ 319 

group therapy, 329-31 

hyperglycemic factor, 255- 
56 


lactic acid values, 255 

learning in, 273, 319-21 

leucotomy on, 316 

in military service, 271 

mother attitudes and, 276 

occupational therapy, 319- 
20 


“oneirophrenics,’’ 256 
problem solving guidance on, 
319-21 

proteolytic enzyme, 256 
psychotherapy, 316 
psychotherapy outcome, 324 
reactions, 266 
regression in 

measure of, 304-5 
Rorschach test and, 275, 302 
sodium benzoate on, 267 
symptom ratings, 275, 292- 


93 
in patient evaluation, 297-98 Sensory processes 


performance 
card order on, 302 
color on, 302 
examiner on, 302 
testing situation on, 302 
physiological activity and, 
258 
practice teaching and, 300 
predictive validity of, 297- 
98, 299 
a8 prognostic indicator, 276, 
298 


psychotherapy outcome 
prediction, 297-98, 311 

psychotic process and, 275 

on schizophrenics, 275 

scoring categories, 302-3 

sex on, 13 

— progress and, 


validity of, 303 
variables, 302-3 
Rosenzweig Picture Frustra- 
tion Test 


comparative psychology and, 
233-34 


palmar resistance 
measurement of, 260 
peripheral sensory stimula- 
tion, 254-55 
physiological psychology and, 
248-50 


tactual discrimination 
ablation on, 252 
see also Animal psychology, 
sensory processes; Chem- 
ical senses; Hearing; 
Taste; Vision; and Visual 
perception 


Smell 


see Chemical senses, olfac- 
tion 


Social behavior 
J-curve 


criticism of, 175 


neurosis on, 264 
prefrontal lobotomy on, 254 
psychiatric conception of, 
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173 

team research in, 173 

of vertebrates, 173 

see also Behavior; Group 
processes; and Social 
psychology 


Social psychology, 171-98 


atomic age and, 197-98 

attitude scales and scaling, 
187-88 

communication and rumor, 
189-90 

comparative psychology and, 
173 


cultural relativism, 172 
ethnocentrism and prejudice, 


fundamental problems of, 
172 


Gestalt orientation, 172 
group interaction 
psychological basis of, 172 
“‘group mind’’ concept, 172 
group processes and,171-98 
intergroup contact 
racial attitudes and, 194 
international conflict, 197- 
98 
international co-operation 
impediments, 197 
opinion-attitude change, 184- 
87 


opinion-attitude methodology, 
187-89 

other disciplines and, 172- 
73 


political participation, 192- 
93 


determinants of, 192 
personality correlates and, 
192 


voting, 193 
psychological assumptions, 
172 


psychosis and, 276-77 
social perception, 195-97 
social therapy, 276-77 
social value, 172 
sociology and psychology 
variables in, 173 
sociopolitical attitudes and 
participation, 190-93 
survey methodology, 188-89 
field coding, 188 
fixed-alternative question- 
naire, 188 
interviewer bias, 188 
office coding, 188 
open-ended interview, 188 
sampling, 188 
telephone interviews, 188 
validity of, 188-89 
— treatment of, 172- 


trends in, 171 
see also Attitudes; Communi- 
cation; Group processes; 
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Leadership; Perception, 
social; and 
Social behavior 
Speech 
development of 
family on, 9, 10 
institutions on, 9 
motivation in, 10 
motor activities and, 9 
siblings on, 11 
social class on, 8 
discriminative stimulus, 
401 
distinctive features theory, 
401-2 
perceptual judgments of, 
402-3 


tests for hearing, 107 
see also Communication 
Stanford Binet 
for mental defectives, 142, 
370-71 
Terman-Merrill revision, 
370-71 
Wechsler Intelligence Scale 
for Children and, 19-22 
Statistical methods 
accident proneness 
binomial correlation 
methods, 385 
**Poisson fit’’ technique 
advance in, 377 
aptitude tests reliability 
coefficients, 385 
block designs, 393 
chi-square test 
bell-shaped curves and, 
394 
expectancies in, 394 
history of, 393 
item analysis, 394-95 
supplementary test to, 394 
tables extension, 395 
use of, 394 
comparative judgment law, 
397 


correlation, regression, 
and selection, 381-86 
cross-validation problems, 
382 
experimental designs, 393 
greco-latin designs 
in learning studies, 392-93 
independent variables 
measurement, 385-86 
item difficulty distribution 
discrimination and, 380 
scaling and, 380 
on score-ability correla- 
tions, 377-78 
selection ratio and, 379 
on test efficiency, 378-79 
joint and compound probabili- 
ties, 395 
learning studies 
greco-latin designs, 392-93 
practice effects, 393 
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multiple correlation coeffi- 
cient, 381-83 

personnel classification, 
386-89 

personnel classification 
problem 

*‘optimal regions’’ method, 

389 


Rao’s lemma, 387-88 
simplex methods, 387-88 
solutions, 386-88 
product moment validity, 385 
randomized designs, 393 
regression equation 
formula for, 381-82 
reliability equation, 377-78 
reliability problem, 384 
research design and, 377-97 
scaling, 380, 396-97 
score-ability correlation 
biserial, 378 
bivariate frequency 
surface, 379 
curvilinear, 377-78 
formulae for, 379-80 
full scatter plots, 380 
score-items correlations 
formulae, 379-80 
sequential tests, 385 
short-cut methods, 382-83 
significance tests, 393-96 
Status indexes, 397 
systematic designs, 393 
test items selection 
sequential analysis of 384 
test theory, 377-81 
textbooks, 397 
truncated distribution, 396 
variance and related designs 
analysis, 391-93 
variance designs 
confounded, 391-92 
F ratios, 392 
higher order interactions, 
392, 393 
ttests, 391 
Wherry’s shrinkage formula, 
381 


see also Factor analysis; and 
Tests 


Stress 


on behavior, 241-44 

disorders, see Psychosoma- 
tics 

on retention, 48 

Rorschach test and, 160-61 

see also Anxiety 


Strong Vocational Interest 
Blank 


A, 4 


ic achiev it, 343- 





44 
as drive measurement, 344 
engineer interests, 208 
faking in, 1382, 208 
identification and, 163 
liberal arts students and, 
347 


occupational level score, 
344 

Physician Scale revision, 
343 

scholastic prediction and, 
346 


validity of, 299, 305 
veterans, 350 


T 


Taste 
adaptation 
in insects, 115-16 
site of, 116 
bilateral lesions on, 112 
discrimination 
specific cortical area 
role in, 112 
flavor testing, 116 
primary 
individual reactions to, 
114-15 
receptors, 111 
sensations 
fusion frequency of, 112 
sensitivity 
chorda tympani role in, 
112-13 
diurnal variations on, 114 
glossopharyngeal nerves 
in, 112-13 
temporal lobectomy on, 112 
thresholds in 
adrenalectomy on, 257-58 
deprivation on, 113-14 
monosodium glutamate on, 
115 
need on, 113 
see also Chemical senses, 
gustation 
Tests 
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reliability coefficients of, 
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specific training and, 138 
California Psychological 
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cross-validation, 382 
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socioeconomic differen- 
ces in, 137 
standardization of, 141- 
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Leiter International Per- 
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Malamud-Sands Rating 
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mechanical comprehension, 
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Meier Seashore Art Appre- 
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objective 
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clinical literature and, 290 
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objection to, 291 
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validity of, 291-92 
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Saslow Screening Inven- 
tory, 287 
scores 
ability and, 377-81 
frequency distribution, 
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SRA Youth Inventory, 345 
social class on, 7 
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factor analysis of, 132, 
345 
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Terman-Miles Masculinity- 
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status, 283 
in trainability prediction, 
133 
Wechsler-Bellevue sub- 
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ture-Frustration Test; 
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Thematic Apperception Test 
achievement motivation, 347 
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multiple discriminant 
function, 141 

predictive validity, 299 
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Vision, 57-77 
basic psychophysical func- 
tions of, 57-65 
binocular equivalence ,69-70 
binocular rivalry, 69-70 
color, 62-65 
adaptation on, 62-63 
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defects in, 63 
in Drosophila, 233, 249 
field size on, 63 
hue discrimination, 
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receptor distribution pat- 
terns and, 64 
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Visual perception 
absolute thresholds, 58-59 
dark-adaptation, 59 
accomodation responses 
age on, 61 
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drugs on, 69 
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neural mechanisms of, 61 
acuity in, 60-62 
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dark adaptation, 61-62 
development of, 66-67 
night, 61-62 
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psychological factors in, 
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pupil size on, 61 
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binocular flicker, 60 
detection in, 75-76 
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thresholds, 59-60 
Bunsen-Roscoe Law, 60 
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cortical lesions on, 252 
cue reversal on, 33 
learned nature of, 30 
x-irradiation on, 250 
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flicker-fusion thresholds, 
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form perception, 72-73 
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outdoor visibility, 75 
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cortical lesions on, 253- 
54 

phylogenetic aspects of, 
71 


thresholds for, 70-71 
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scotopic sensitivity 
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see also Perception; and 
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